AD-A243  671 


?  r  ■? 


CSDL-T-1083 

MULTIVARIABLE  CONTROL  OF  THE 
SPACE  SHUTTLE  REMOTE 
MANIPULATOR  SYSTEM  USING 

H2  and  Hoo  optimization 

by 

Om  Prakash  II 
May  1991 


Master  of  Science  Thesis 
Massachusetts  Institute  of  Technology 


Approved  for  public  release; 
distribution  unlimited. 

NASA- JSC  6/5/91 


91-17885 

■  _  _ _  aiii  lesi 


91  I2I3  168 


The  Charles  Stark  Draper  Laboratory.  Inc. 

555  Technology  Square,  Cambridge.  Massachusetts  021 39-3563 


Multivariable  Control  of  the 
Space  Shuttle  Remote  Manipulator  System 
Using  H2  and  Hoo  Optimization 

by 

Om  Prakash  II 

B.S.,  En^neering  Sciences, 

United  States  Air  Force  Academy 
(1989) 

SUBMITTED  TO  THE  DEPARTMENT  OF  AERONAUTICS  AND 
ASTRONAUTICS  IN  PARTIAL  FULFILLMENT  OF  THE 
REQUIRMENTS  FOR  THE  DEGREE  OF 

MASTER  OF  SCIENCE 

at  the 

MASSACHUSETTS  INSTITUTE  OF  TECHNOLOGY 

May,  1991 

(c)  Om  Prakash  II,  1991.  AH  rights  reserved. 


Signature  of  Author _ 

Department  of  Aeronautics  and  Astronautics 

May,  1991 


Certified  by 


Certified  by 


- - 

V  Neil  J.  Adams 

Technical  Staff,  C.  S.  Draper  Labs 
Technical  Supervisor 

W al(a€f  £  _ 

Wallace  E.  VanderVelde 
Professor  of  Aeronautics  and  Astronautics 

Thesis  Supervisor 


Accepted  by 


Professor  Harold  Y.  Wachman 
Chairman,  Departmental  Graduate  Committee 


1 


AoaMsloa  ?or 

NT  13  QUikl  ' 
DTIC  TAB 
Uoarino'U'Kdd 
Just  If  Icat  I  oti. 


•/— - 

r  IBiit.  1  uii/ 

Availflbliity  f 
jA*ali  nar/ 
rDlat  I  Special 


£>r(c  \ 


Multivariable  Control  of  the 
Space  Shuttle  Remote  Manipulator  System 
Using  H2  and  Hqq  Optimization 


Om  Prakash  II 

Submitted  to  the  Department  of  Aeronautics  and 
Astronautics  on  May  10,  1991  in  partial  fulfillment  of  the 
requirements  for  the  degree  of  Master  of  Science 


Ui,  f  yr. Abstract  ,  ^  ^  1 
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Chapter  1 


Introduction 


Future  Space  Shuttle  operations  include  the  on-orbit  assembly  of  the  Space  Station  Freedom 
(SSF).  This  assembly  process  will  require  the  Shuttle  Remote  Manipulator  System  (SRMS)  to 
manipulate  many  separate  elements  of  the  SSF  toward  its  ultimate  construction.  Experience  in 
SRMS  payload  operations  gained  over  the  last  decade  indicate  deficiencies  in  current  operations. 
Some  of  these  deficiencies  are  of  particular  concern  for  the  assembly  of  the  SSF.  One  such 
concern  is  the  long  durations  of  time  required  to  damp  vibrations  of  the  SRMS  and  attached 
payload  after  a  given  maneuver.  Attitude  thruster  firings  of  the  Shuttle  Flight  Control  System 
(SFCS)  during  SRMS  operations  is  another  source  of  undesired  excitations.  The  time  spent 
waiting  for  these  undesired  vibrations  to  damp  out  can  consume  a  significant  fraction  of  the 
total  mission  time.  For  example,  the  estimated  time  spent  waiting  for  the  SRMS  to  damp  during 
the  first  sixteen  Shuttle  flights  required  for  the  SSF  assembly  task  is  illustrated  in  Figure  1.1  [1]. 

The  SRMS  consists  of  six  joints  connected  via  tubular  structural  members  with  a  grapple 
at  the  end  effector  for  grasping  payloads.  Some  of  these  structural  members  are  longer  than 
6  meters.  The  total  length  of  the  extended  arm  is  over  15  meters,  making  for  a  very  flexible 
structure.  Control  of  the  arm  is  performed  by  six  motor  rate  feedback  servos,  one  for  each  joint. 
In  order  to  perform  end  effector  or  payload  maneuvers,  the  SRMS  has  five  modes  of  operation. 
These  modes  include  the  Automatic  Mode,  the  Manual  Augmented  Mode,  the  Single  Joint 
Mode,  the  Direct  Drive  Mode,  and  the  Backup  Drive  Mode.  The  Direct  Drive  Mode  and  the 
Backup  Drive  Mode  are  both  contingency  modes  that  allow  the  SRMS  to  operate  even  if  certain 
system  failures  occur.  After  the  SRMS  completes  a  maneuver  in  one  of  the  three  regular  modes 
each  servo  is  initially  given  a  zero  rate  command.  After  the  zero  rate  command  state,  the  SRMS 
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Flight  number 

Figure  1.1.  Cumulative  SRMS  Damping  Time 

drops  into  Position  Hold.  In  Position  Hold,  as  the  name  implies,  the  system  tries  to  maintain 
its  final  end-point  position  and  attitude.  The  design  philosophy  for  the  SRMS  is  discussed  in 
detail  by  Ravindran  [2]. 

Currently,  the  SRMS  maintains  Position  Hold  end-point  control  using  the  joint  control 
servos.  In  Position  Hold,  each  joint  is  equipped  with  a  controller  that  supplies  joint  rate 
commands  based  upon  angle  feedback  from  joint  encoders.  However,  since  boom  flexibility 
is  not  observable  in  the  joint  angle  feedback  measurements,  no  active  vibration  suppression 
currently  exists  for  control  of  the  SRMS  and  attached  payload.  Simple  movements  of  the  SRMS 
with  a  heavy  payload  attached  can  cause  vibration  of  the  arm  with  frequencies  inside  the  control 
bandwidth.  An  even  larger  source  of  excitation  is  Shuttle  attitude  control  thruster  firings  that 
are  required  to  support  payload  operation  constraints  such  as;  lighting,  contamination,  and 
release  or  tip-off  requirements.  The  system  must  wait  for  these  vibrations  to  die  out  since  it  is 
not  actively  damping  them.  Current  control  system  design  technology  offers  the  opportunity 
to  design  controllers  that  will  actively  damp  such  undesired  vibrations.  The  key  is  to  provide 
such  controllers  with  information  about  the  structural  flexibility  of  the  SRMS. 

Gilbert,  et  al.  [1]  investigated  active  damping  of  the  SRMS  using  a  Linear  Quadratic  Gaus¬ 
sian  (LQG)  controller.  Gilbert  approached  the  problem  by  replacing  the  zero  rate  command 
state  with  a  LQG  controller  after  the  SRMS  finishes  a  maneuver.  Assuming  linear  accelerom¬ 
eters  on  the  end  effector,  this  controller  damps  the  joint  rate  commands  to  a  specified  level 
before  falling  into  Position  Hold.  Gilbert  has  applied  this  technique  to  the  Single  Joint  and 
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Manual  Augmented  Modes  of  operation. 

Engineering  simulations  have  shown  that  Position  Hold  itself  can  be  unstable,  particularly 
with  heavy  payloads  associated  with  the  SSP  assembly  task.  There  are  several  advantages  in 
providing  compensation  for  the  current  Position  Hold  mode.  Since  Position  Hold  is  universal  to 
the  three  standard  control  modes,  improving  its  performance  improves  the  SRMS  over  its  full 
range  of  normal  operations.  Furthermore,  the  joint  defections  are  relatively  small  in  Position 
Hold.  Therefore,  the  nonlinear  plant  associated  with  a  six  degree-of-freedom  robotic  manip¬ 
ulator  is  operating  in  a  fairly  linear  region.  Significant  nonlinearities  come  from  the  servos 
rather  than  the  kinematics  associated  with  the  geometry.  The  servo  nonlinearities  are  easier  to 
characterize  using  linear  control  design  techniques. 

This  thesis  provides  a  survey  of  several  linear  control  design  techniques  as  applied  to  vibra¬ 
tion  damping  and  disturbance  rejection  of  the  SRMS  while  operating  in  Position  Hold.  Rather 
than  damping  the  joint  angles,  the  end  effector  position  and  attitude  are  damped  since  reduc¬ 
ing  oscillations  of  the  payload  itself  is  more  crucial.  Design  methodologies  examined  include: 
Linear  Quadratic  Regulator  (LQR)  synthesis.  Linear  Quadratic  Gaussian  or  H2  optimization, 
and  optimization  derived  compensation.  Designs  are  compared  with  the  current  SRMS 
Position  Hold  controller  in  both  the  frequency  and  time  domains.  Regulators  are  designed  to 
maintain  the  SRMS  end  effector  position  and  attitude  relative  to  an  Orbiter-fixed  reference 
frame  while  being  subjected  to  Shuttle  thruster  firings. 

There  are  several  significant  obstacles  to  overcome  before  linear  control  design  techniques 
can  be  applied  to  this  problem.  First,  the  nonlinear  system  must  be  linearized.  Because  the  arm 
is  essentially  in  a  fixed  configuration  during  Position  Hold,  the  six  servos  of  the  SRMS  are  the 
main  sources  of  nonlinearities.  These  nonlinearities  include:  current  and  torque  limiters,  trim 
integrators,  gearbox  stiffness,  digital  and  analog  tachometers,  and  joint  motor  stiction /frictions. 
The  effect  of  these  nonlinearities  must  either  be  deemed  insignificant,  or  somehow  linearized 
to  a  nominal  value.  One  method  to  model  the  effects  of  the  significant  nonlinearities  on  the 
linearized  system  is  to  represent  them  as  parameter  uncertainties.  In  this  manner,  it  is  possible 
to  perform  robust  stability  analysis  via  ^-analysis  on  the  linear  plant.  This  analysis  will  help 
indicate  if  the  system  is  stable  in  face  of  the  nonlinearities  present  in  the  system. 

These  nonlinearities  that  are  modelled  as  parametric  uncertainties  pose  a  significant  chal¬ 
lenge  to  any  linear  controller  design.  The  current  Position  Hold  system  is  not  robustly  stable 
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with  these  nonlinearities  as  seen  by  its  limit  cycle  behavior.  In  this  thesis,  each  of  the  three 
design  methodologies,  LQR,  and  Hoot  offer  significant  performance  improvement  over  the 
current  system.  However,  none  of  the  designs  achieves  robust  stability.  Because  the  uncertainty 
of  the  system  is  parametric,  not  unmodeled  dynamics,  a  simple  reduction  of  the  bandwidth  will 
not  achieve  robust  stability.  The  outline  of  this  thesis  is  as  follows; 

Chapter  2 

This  Chapter  describes  in  detail  the  Shuttle  Remote  Manipulator  System.  The  Position 
Hold  mode  is  the  focus  of  discussion  to  illuminate  the  advantages  in  improving  its  performance. 
Aside  from  a  physical  description  of  the  SRMS,  the  five  control  modes  are  explored  in  depth: 
the  Automatic  Mode,  the  Manual  Augmented  Mode,  the  Single  Joint  Mode,  the  Direct  Drive 
Mode,  and  the  Backup  Drive  Mode. 

Chapter  3 

This  Chapter  presents  the  development  of  the  SRMS  model.  The  equations  of  motion  are 
derived  for  a  simple  one  joint  case  and  then  extended  to  six  joints.  A  nonlinear  servo  model  is 
presented  and  then  linearized  by  eliminating  those  effects  that  are  negligible.  Two  nonlinearities 
that  have  significant  contribution  are  treated  as  nominal  parameter"-  with  uncertainty.  The 
system  is  placed  in  a  generalized  three-block  form.  Time  domain  simulations  of  both  the  linear 
model  and  a  higher  fidelity  nonlinear  model  of  the  SRMS  are  compared  to  verify  the  linear 
model. 

Chapter  4 

This  Chapter  starts  out  by  examining  the  characteristics  of  the  open-loop  design  plant. 
Model  reduction  is  applied  since  six  poles  are  very  f^«t  relative  to  the  rest  of  the  system.  This 
reduced  order  model  is  used  to  design  new  regulators  in  subsequent  Chapters.  The  performance 
and  control  effort  criteria  are  presented  that  the  subsequent  regulator  designs  will  be  judged 
by.  As  a  baseline  for  comparison,  the  performance  of  the  current  system  is  analyzed.  Two 
techniques  for  robust  stability  analysis,  the  small  gain  test  and  p-analysis,  are  presented  as 
well.  These  techniques  are  applied  to  the  current  Position  Hold  controller  to  illustrate  the 
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process  as  well  as  explore  the  characteristics  of  the  current  regulator.  The  shortfalls  of  these 
techniques  when  applied  to  real  parametric  uncertainty  are  explored  as  well. 

Chapter  5 

In  this  Chapter,  the  first  new  regulator  is  designed  using  the  Linear  Quadratic  Regulator 
(LQR)  methodology.  This  design  requires  full-state  feedback  uncorrupted  by  sensor  noise.  The 
basic  theory  as  well  as  the  design  process  is  explored.  The  resulting  regulator’s  performance  is 
compared  to  the  current  system.  Robust  stability  analysis  is  performed  using  the  techniques 
developed  in  Chapter  4. 

Chapter  6 

An  H2  optimal  compensator  is  developed  in  this  Chapter.  The  H2  theory  is  presented  as 
well.  This  model-based  compensator  does  not  require  full-state  feedback.  The  H2  regulator 
relies  on  state  estimates  obtained  from  the  noisy  measurements  of  the  system  to  minimize  the 
H2  norm  of  a  transfer  function.  As  before,  the  resulting  regulator’s  performance  is  compared  to 
the  current  Position  Hold.  Robust  stability  analysis  is  performed  using  the  techniques  developed 
in  Chapter  4. 

Chapter  7 

In  this  Chapter,  an  optimal  compensator  is  developed.  Both  the  theory  and  the  design 
process  parallel  the  H2  design  of  Chapter  6.  The  Hoa  regulator  minimizes  the  Hoo  norm  of  a 
transfer  function.  The  H^o  regulator  is  compared  to  both  the  current  system  2uid  the  H2  system 
to  help  highlight  the  difference  in  the  choice  of  norms.  Robust  stability  is  explored  as  before. 
In  an  effort  to  obtain  a  robustly  stable  compensator,  a  different  transfer  function  whose  norm 
is  minimized  is  selected.  In  this  manner,  it  is  possible  to  design  for  robust  stability  directly. 

Chapter  8 

This  Chapter  offers  a  summary  of  the  conclusions  reached  during  the  development  of  this 
thesis  and  recommendations  for  further  work. 
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Chapter  2 


Description  of  the  Shuttle  Remote 
Manipulator  System 


The  Shuttle  Remote  Manipulator  System  (SRMS)  is  an  integral  subsystem  of  the  Payload 
Deployment  and  Retrieval  System  (PDRS)  on  board  the  Space  Shuttle.  The  SRMS  is  used 
primarily  for  the  deployment  of  payloads  from  the  Orbiter  cargo  bay,  as  well  as  retrieving 
payloads  from  orbit  and  stowing  them  in  the  cargo  bay.  Payloads  of  up  to  30,000  kg  mass  with 
dimensions  of  up  to  18.3  meters  in  length  and  4.3  meters  in  diameter  can  be  handled  by  the 
system  from  up  to  a  15  meter  distance  in  space.  Other  applications  supported  by  the  SRMS 
include:  crew  extravehicular  activities  (EVA);  inspection,  servicing  and  repair  of  spacecraft; 
transfer  of  men  and  equipment;  as  well  as  the  on-orbit  assembly  of  the  Space  Station  FVeedom 
(SSF)  [2]. 

The  SRMS  is  operated  by  a  mission  specialist  from  the  aft  port  window  location  of  the  crew 
compartment  in  the  Orbiter.  The  SRMS  operator  uses  a  dedicated  control  system  with  the  aid 
of  direct  viewing  as  well  as  a  closed  circuit  television.  Television  cameras  are  located  in  the 
cargo  bay  as  well  as  being  mounted  on  the  arm  itself. 

SPAR  Aerospace  Limited  in  Toronto  in  conjunction  with  a  team  of  Canadian  companies 
performed  design,  development,  testing,  and  manufacture  of  the  SRMS.  All  of  the  work  was 
performed  under  a  contract  from  the  National  Research  Council  of  Canada  (NRCC)  working 
under  the  auspices  of  NASA  [2]. 

This  Chapter  describes  the  SRMS  system  in  detail  so  that  the  motivation  for  design  choices 
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Figure  2.1.  Shuttle  Remote  Manipulator  System 

can  be  understood.  Section  2.1  details  the  physical  structure  of  the  SRMS.  Physical  limitations 
of  the  arm  are  also  discussed.  Section  2.2  explains  the  five  control  modes  of  the  SRMS.  Of  these 
five  control  modes,  three  are  for  routine  Shuttle  operations  and  the  remaining  two  are  backup 
systems.  Section  2.3  concentrates  on  Position  Hold  and  points  out  its  shortcomings. 


2.1  Physical  Description 

The  SRMS  is  comprised  of  a  six  degree-of-freedom  controlled  anthropomorphic  man-machine 
system.  This  system  includes  an  approximately  15  meter  long  manipulator  arm  attached  via 
a  swing-out  joint  to  the  port  longeron  of  the  Shuttle  Orbiter  cargo  bay,  a  control  and  display 
system,  a  controller  interface  unit  between  the  manipulator  and  the  Orbiter  computers,  as  well 
as  the  closed  circuit  televisions  and  lighting  systems  [3].  A  general  diagram  of  the  SRMS  is 
shown  in  Figure  2.1  [2]. 

The  mechanical  arm  assembly  is  shown  in  its  stowed  position  in  Figure  2.2.  The  arm  is 
comprised  of  six  joints  connected  via  structural  elements  providing  a  hemisphere  of  reach  for  the 
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end  effector.  The  joint  sequence  is  as  follows:  the  shoulder  yaw  and  shoulder  pitch  joints  near 
the  arm  attachment  point;  the  elbow  pitch  joint  at  the  arm  midsection;  and  finally  three  joints, 
wrist  pitch,  yaw  and  roll,  near  the  arm  tip.  Three  dimensional  translation  of  the  end  effector 
is  provided  by  the  shoulder  and  elbow  joints,  whereas  the  three  wrist  joints  are  responsible  for 
its  three  dimensional  orientation  [4].  The  upper  boom  of  the  arm,  which  is  6.4  meters  long, 
connects  the  shoulder  and  elbow  joint.  Next,  the  7  meter  lower  arm  boom  connects  the  elbow 
joint  to  the  wrist  pitch  joint.  The  end  effector  is  attached  to  the  wrist  roll  joint.  The  arm 
booms  are  made  from  a  graphite/epoxy  composite  material.  They  have  a  thin-walled  tubular 
cross  section  with  internal  stiffening  rings  [3]- 

The  electro-mechanical  servos  of  each  joint  are  similar  to  each  other  except  for  the  gear 
trains.  The  gear  trains  vary  in  their  gear  ratios  with  a  maximum  of  1842:1  for  the  shoulder 
joint.  All  gear  trains  are  designed  to  provide  both  forward  and  backward  drive  capabilities. 
Backdrive  occurs  when  a  joint  is  driven  by  an  external  torque  applied  to  the  output  of  the 
gearbox.  Backdrive  is  necessary  to  limit  the  loads  which  can  be  applied  to  each  joint. 

Each  servo  is  comprised  of  a  motor  module  and  a  high  efficiency,  low  speed  epicyclic  gear 
train.  Each  motor  module  has  a  brushless  DC  permanent  magnet  motor,  a  primary  and  backup 
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optical  commutator,  a  tachometer,  and  an  electro-mechanical  brake.  Angle  poeition  feedback 
is  provided  from  each  joint  via  optical  position  encoders  [2]. 

The  end  effector  is  attached  to  the  wrist  roll  joint.  The  end  effector  is  comprised  of  three 
snare  wires  allowing  it  ^o  grapple  a  payload  and  keep  it  rigidly  attached  to  the  Orbiter,  or  to 
deploy  a  payload.  Capture  or  release  of  a  payload  is  controlled  by  the  arm  operator  who  can 
view  the  grapple  fixture  through  a  wrist  camera  [5]. 

2.2  SRMS  Control  Modes 

There  are  five  control  modes  of  operation  for  the  SRMS;  three  standard  and  two  back-up 
modes.  The  purpose  of  the  SRMS  control  algorithms  is  to  convert  various  inputs  from  several 
possible  sources  into  rate  commands  for  the  six  joint  servos.  The  servo  rates  are  the  only 
control  input  of  the  system.  Currently  the  only  feedback  to  the  control  algorithms,  besides  the 
operator  in  the  loop,  are  the  joint  angles  from  the  high  precision  poeition  encoders  in  each  joint. 
Typically  the  operator  desires  to  control  the  end  effector  or  some  point  of  resolution  (FOR) 
on  the  payload;  either  rotationally,  translationally,  or  a  combination  of  both.  Inputs  to  the 
control  algorithms  could  come  from  either  hand  controller  inputs  made  by  the  arm  operator, 
or  an  end  effector  auto  sequence  trajectory  stored  in  the  Orbiter’s  general  purpose  computers 
(CPC)  before  flight  [2], 

The  five  modes  of  operation  of  the  SRMS  are: 

1.  Manual  Augmented  Mode 

2.  Automatic  Mode 

•  operator  commanded 

•  preprogrammed  auto  sequence 

3.  Single  Joint  Drive  Mode 

4.  Direct  Drive  Mode 

5.  Backup  Drive  Mode 

The  Manual  Augmented  Mode  includes  the  operator  in  the  control  loop;  this  is  the  normal 
mode  of  operation  of  the  arm.  The  operator  commands  the  end  effector  (or  FOR  of  the  payload) 
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translational  and  rotational  velocity  using  the  two,  three-degree-of-freedom  hand  controllers. 
Commands  from  the  hand  controllers  are  resolved  into  joint  rate  conunands  by  the  SRMS 
computers.  The  operator  has  a  choice  of  four  different  coordinate  operating  systems  [2].  In  this 
mode  of  operation,  all  joints  can  be  commanded  simultaneously. 

Two  types  of  Automatic  Modes  are  available  in  the  SRMS.  In  the  operator  commanded 
auto  sequence,  the  arm  is  driven  along  a  ‘straight  line’  to  coordinates  input  to  the  GPC  by 
the  operator  during  flight.  During  a  preprogrammed  auto  sequence,  the  operator  can  select 
up  to  four  of  20  preprogrammed  trajectories  to  execute  at  one  time.  These  preprogrammed 
trajectories,  which  are  verified  before  flight,  are  composed  of  up  to  200  straight  line  elements 
stored  in  the  GPC.  The  operator  can  select  any  point  along  the  straight  line  elements  as  a 
pausing  point  for  the  arm  until  he  or  she  requests  it  to  complete  the  trajectory  [3]. 

In  the  Single  Joint  Drive  Mode  the  operator  can  control  an  individual  joint  with  the  full 
support  of  the  GPC.  The  operator  selects  the  joint  and  provides  a  fixed  drive  signal.  The  GPC 
then  interprets  the  appropriate  rate  commands.  The  remaining  joints  are  kept  in  a  Position 
Hold  status  as  described  in  Section  2.3  [3]. 

Direct  Drive  Mode  is  the  first  contingency  mode  of  the  SRMS.  As  in  the  Single  Joint  Mode, 
the  operator  may  command  only  one  joint  at  a  time.  In  this  mode,  the  remaining  joints  are 
deactivated  and  held  in  position  by  their  brakes.  Commands  to  the  working  joint  are  hardwired 
to  the  Motor  Drive  Amplifiers  (MDA).  Commands  bypass  the  GPC,  data  buses,  and  the  servo 
control  loop  [2]. 

The  Backup  Drive  Mode  is  an  additional  contingency  mode  provided  to  help  insure  the 
SRMS  is  fail-safe.  Similar  to  the  Direct  Drive  Mode,  only  one  joint  may  be  controlled  at  a 
time.  It  is  used  when  no  primary  drive  channels  are  available.  It  bypasses  all  primary  channel 
electronics,  including  the  GPC,  and  uses  a  separate  Backup  Drive  Amplifier  (BDA)  and  backup 
motor  commutator  driven  by  a  separate  power  source  [3]. 

2.3  Position  Hold 

After  completion  of  an  SRMS  maneuver  in  one  of  the  three  normal  modes  of  operation, 
each  control  servo  is  given  a  zero  rate  command  for  a  short  period  of  time  (determined  by 
the  payload/arm  configuration).  After  the  zero  rate  command,  each  servo  drops  into  Position 
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Figure  2.3.  Typical  SRMS  Maneuver  Profile 

Hold  Mode.  This  zero  rate  command  period  is  the  window  in  which  the  controllers  designed 
by  Gilbert,  et  al.[l]  reside.  It  is  designed  to  damp  end  effector  velocities  as  quickly  as  possible. 
A  typical  SRMS  maneuver  time  profile  is  shown  in  Figure  2.3. 

As  the  name  implies,  the  purpose  of  the  Position  Hold  Mode  is  too  keep  the  arm  in  a  fixed 
position.  This  mode  is  employed  after  a  maneuver  to  give  the  SRMS  time  to  settle  and  maintain 
a  fixed  position  and  attitude.  At  this  point  the  operator  either  moves  on  to  the  next  maneuver, 
or  has  reached  the  desired  final  position.  This  is  actually  a  subtle  point;  between  maneuvers  of  a 
given  series  of  maneuvers  the  operator  is  not  necessarily  concerned  with  a  particular  end  effector 
position  and  attitude,  rather  he  or  she  would  like  all  oscillations  to  stop  before  moving  on  to 
the  next  maneuver.  However,  after  the  final  maneuver  of  a  sequence  the  end  effector  position 
and  attitude  is  the  principal  concern,  especially  in  operations  such  as  SSF  assembly.  For  this 
reason,  linear  controller  designs  presented  subsequently  will  regulate  end  effector  position  and 
attitude  rather  than  its  velocity. 

During  Position  Hold  Mode,  joint  angle  positions  measured  via  the  joint  angle  encoders 
are  fed  back  to  the  servo  control  loops.  It  is  important  to  realize  that  the  joint  angles  are 
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converted  into  joint  rate  commands  via  proportional  feedback  only.  With  only  the  joint  angles 
provided  to  the  servo  control  loops,  no  information  about  flexibility  in  the  manipulator  booms 
is  available.  Currently  the  operator  must  wait  for  the  arm  structure  to  damp  excitations  in  the 
booms  resulting  from  residual  energies  after  a  maneuver.  Furthermore,  while  the  SRMS  is  in 
Position  Hold,  current  guidelines  allow  Shuttle  attitude  reaction  control  system  (RCS)  firings. 
In  such  a  situation,  Position  Hold  should  maintain  the  end  effector  in  its  current  position  and 
attitude  relative  to  an  Orbiter-fixed  reference  frame.  However,  Shuttle  attitude  jet  firings  excite 
considerable  flex  in  the  booms  of  the  SRMS.  Furthermore,  when  heavy  payloads  (>10K  lbs) 
are  deployed  on  the  SRMS,  this  RCS  induced  flexure  can  result  in  closed-loop  Shuttle  flight 
control  system  instabilities. 

If  it  is  desired  to  reduce  the  fraction  of  mission  time  squandered  waiting  for  the  SRMS  to 
settle  and  to  reduce  dynamic  interaction  with  other  control  systems,  the  benefits  of  focusing 
on  improving  Position  Hold  Mode  are  apparent.  Position  Hold  Mode  is  universal  to  the  three 
regular  control  modes;  therefore,  improving  its  performance  improves  the  SRMS  for  its  full 
range  of  normal  operation. 
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Chapter  3 


Modelling  the  SRMS 

Modelling  is  a  delicate  task  of  representing  the  dynamics  of  the  Orbiter/SRMS/Fayload 
system  which  are  important  for  the  envelope  of  design  while  ignoring  dynamics  which  are  out¬ 
side  the  envelope.  We  are  interested  in  improving  SRMS  performance  while  it  is  operating  in  its 
Position  Hold  Mode.  Position  Hold  Mode  is  a  regulator;  one  that  desires  to  subdue  initial  con¬ 
ditions  (residual  energies  from  arm  maneuvers),  and  reject  disturbances  (introduced  by  Shuttle 
attitude  jet  firings).  In  this  mode,  the  arm  is  essentially  in  a  fixed  position;  therefore,  chang¬ 
ing  mass  and  inertia  matrices  associated  with  changing  arm  geometries  are  not  a  significant 
factor.  However,  the  six  electro-mechanical  control  servos  at  the  joints  do  contain  substantial 
nonlinearities  in  their  dynamics  which  cannot  be  ignored. 

The  approach  taken  to  modelling  the  SRMS  is  to  divide  the  system  into  two  subsystems. 
First,  the  dynamics  of  the  Orbiter/SRMS/Payload  (arm)  subsystem  are  considered  without  the 
dynamics  of  the  servos.  Second,  the  control  servo  dynamics  are  modelled.  A  block  diagram 
depicts  the  total  system  in  Figure  3.1. 

Inputs  to  the  ‘arm  dynamics’  sub»>ystem  include  Shuttle  attitude  thruster  firings,  sensor 
noise,  as  well  as  the  torques  on  e^ch  joint  generated  by  their  corresponding  control  servo. 
These  inputs  are  combined  to  determine  the  end  effector  acceleration.  This  acceleration  is  then 
integrated  to  obtain  the  outputs  of  this  subsystem  which  are  end  effector  position  and  attitude 
as  well  as  the  six  joint  angles.  The  joint  angles  are  fed  back  through  a  proportional  feedback 
gain  matrix  to  the  control  servos.  If  we  consider  the  joint  angle  and  end  effector  perturbations 
to  be  small,  which  is  the  case  in  Position  Hold,  this  subsystem  is  essentially  linear. 

The  control  servo  subsystem  consists  of  six  servos  which  have  as  inputs  joint  rate  com- 
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Figure  3.1.  Block  Diagram  of  the  Compensated  System 

mands  from  a  designed  controller  as  well  as  the  joint  angles  determined  by  the  arm  dynamics. 
This  subsystem  has  as  outputs  the  torques  generated  on  each  joint.  There  are  several  nonlin¬ 
earities  present  in  each  servo,  including:  current  and  torque  limiters,  trim  integrator  limiter, 
gearbox  stiffness,  digital  and  analog  tachometers,  as  well  as  joint  motor  stiction/frictions.  The 
approach  taken  to  linearize  this  nonlinear  subsystem  is  to  treat  the  nonlinearities  as  modelling 
uncertainties  on  a  linear  plant. 

Section  3.1  generates  the  equations  of  motion  for  the  Orbiter/SRMS/Payload  system.  Equa¬ 
tions  are  generated  in  an  Orbiter-fixed  reference  frame.  The  equations  of  motion  are  derived  for 
a  simplified  one  joint  example  and  then  extended  to  six  joints.  Assumptions  used  in  modelling 
are  also  outlined. 

Section  3-2  begins  with  a  nonlinear  model  of  the  joint  servos  and  obtains  a  simplified  linear 
model.  This  linear  model  is  then  combined  with  the  arm  plant  in  order  to  generate  a  linearized 
model  of  the  complete  SRMS.  In  order  to  linearize  the  control  servos,  certain  nonlinearities 
are  ignored  which  have  minimal  contribution  in  Position  Hold  Mode.  The  remaining  nonlin¬ 
earities  are  linearized  by  selecting  ‘nominal’  values,  but  their  effects  are  modelled  as  parameter 
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uncertainties  on  the  linear  plant.  The  standard  three  block  form  of  a  transfer  function  used  for 
design  and  robust  analysis  is  introduced  as  well.  The  limitations  of  the  methods  used  in  this 
Section  are  also  discussed. 

The  resulting  24  state  linearized  model  is  compared  with  a  higher  fidelity  nonlinear  simu¬ 
lation  in  Section  3-3.  This  nonlinear  simulation  has  been  compared  with  actual  flight  data  to 
confirm  its  accuracy.  Comparisons  are  made  in  the  time  domain  in  Position  Hold  Mode.  The 
effects  of  varying  the  parameter  uncertainties  of  the  linear  plant  in  an  effort  to  capture  the 
range  of  the  nonlinear  behavior  are  also  investigated.  Section  3.4  draws  the  conclusions  of  this 
Chapter.  The  deficiencies  of  the  major  assumptions  used  in  this  Chapter  are  also  re-explored. 


3.1  Equations  of  Motion  for  the  Orbiter/SRMS/Payload  Sys¬ 
tem 

Much  of  the  derivation  of  the  equations  of  motion  in  this  section  is  based  on  the  SRMS 
simulator  LSAD’ .  LSAD  is  a  higher  fidelity  nonlinear  simulation  of  the  SRMS  and  flight  control 
system  of  the  Space  Shuttle.  Details  on  LSAD  and  the  basis  of  this  section  can  be  found  in 
References  [7],  [8],  and  [9]. 

In  deriving  the  equations  of  motion,  the  mass  of  the  connecting  structure  between  the  Or- 
biter  and  payload  can  be  ignored  when  a  payload  mass  exceeds  5,000  lbs.  Previous  simulations 
have  found  this  assumption  to  be  reasonable.  LSAD  as  well  is  based  on  this  assumption.  Ref¬ 
erence  [7]  has  the  detailed  derivation  of  the  rotational  dynamics  between  two  bodies  connected 
via  a  massless,  flexible  structure.  Rather  than  derive  the  six  joint  case.  Section  3-1.1  examines 
the  one  joint  case  and  Section  3.1.2  extends  it  to  six  joints  based  on  the  work  in  Reference  [7]. 

3.1.1  Equations  of  Motion  for  the  Single  Joint  Case 

Figure  3.2  represents  the  case  of  a  single  joint  in  Position  Hold.  The  SRMS  has  been  recast 
as  a  point  mass,  m,  oscillating  on  the  end  of  a  massless  flexible  beam  which  is  attached  to  a 
single  joint.  A  joint  in  Position  Hold  behaves  as  aspring  with  some  damping.  Table  3.1  defines 
the  essential  variables. 

'I, SAT)  stands  for  I.ess  Singing  And  Dancing;  a  take  off  of  Spar  Aerosparr's  ASAD:  All  Singing  And  Dancing. 
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Table  3.1.  Variable  Notation  for  One  Joint  EOM 


Variable 

Description 

P 

force  acting  on  the  beam 

torque  acting  at  the  joint 

w 

displacement  of  the  beam  tip 

Wf 

deflection  of  the  beam  tip  due  to  flexure 

Wj 

deflection  of  the  beam  tip  due  to  joint  angle 

a 

bending  flexibility  of  the  beam 

7j 

joint  angle 

L 

length  of  the  beam 

From  these  definitions 


w 

—  my  4-  Wj 

(3.1) 

WJ 

II 

(3.2) 

Wf 

=  ap 

(3.3) 

=  Lp 

(3.4) 

Therefore 

w  =  Ljj  +  ap  =  L-fj  +  a-j-  (3.5) 

From  Newton’s  second  law,  in  the  absence  of  external  forces,  the  force  at  the  tip,  or  the 
torque  over  the  length,  is  equal  to  the  acceleration  of  the  mass 


p  =  —mw 

When  combined  with  Equation  3.4,  this  can  be  recast  as 


(3.6) 


w  =  — 


1 


(3.7) 


Combining  Equation  3.7  with  3.5  obtains 


+  ”]  <*■*> 
Equations  3.7  and  3.8  together  achieve  the  objectives  of  determining  the  end  effector  posi¬ 
tion,  w,  (position  only  for  the  one  joint  case)  and  the  joint  angle  position,  jj,  given  a  torque 
input  to  the  system,  Tj.  These  two  equations  can  be  rewritten  as 


w  =  —8~^m~^L  ’r; 


7j 


=  —  8  ^ L~^  m~^  L~^  +  L  Tj 


(3.9) 

(3.10) 


Though  this  representation  seems  notationally  verbose,  it  will  help  motivate  the  six  joint 
case.  The  extension  of  this  problem  to  six  joints  involves  more  complex  notation,  but  essentially 
the  same  concepts. 
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3.1.2  Equations  of  Motion  for  the  Six  Joint  Case 


In  extending  this  problem  to  six  joints  and  three  dimensions,  it  is  necessary  to  introduce 
vector  and  matrix  notation  to  keep  track  of  positions  and  rotations.  Furthermore,  we  will  add 
the  effects  of  external  forces  caused  by  Shuttle  attitude  thruster  firings  to  the  system. 

As  before,  ID,  will  define  the  state  of  the  end  effector  in  an  Orbiter-fixed  reference  frame. 
Now,  however,  in  three  dimensions  it  is  composed  of  a  displacement  vector  and  a  rotation  vector 

(3.11) 

where  q  is  the  vector  defining  the  translational  deflection  of  the  end  effector  due  to  the  applied 
load.  Similarly,  6  is  the  vector  defining  the  angular  deflection  of  the  end  effector  on  each  axis 
due  to  the  applied  load.  Since  Position  Hold  Mode  entails  small  variations  about  equilibrium, 
rotations  can  be  described  by  vectors.  Table  3.2  makes  similar  definitions  as  before. 


Table  3.2.  Variable  Notation  for  Six  Joint  EOM 


Variable 

Description 

P 

applied  force  from  the  payload  at  the  end  effector 

T 

applied  moment  from  the  payload  at  the  end  effector 

^7 

vector  of  servo  torques  at  each  joint 

XV 

end  effector  state  vector 

Wf 

end  effector  deflection  due  to  flexibility  of  the  arm 

Wj 

end  effector  deflection  due  to  rotations  of  the  joints 

A 

6  by  6  flexibility  matrix 

T 

joint  angle  vector 

L 

6  by  6  Jacobian  matrix 

As  before,  xv  may  be  decomposed  as  follows 


W  =  Wj  +  Wf 


(3.12) 
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where 


wj  =  L-j 


(3.13) 


&nd 


Wf  =  A 


The  six  joint  analog  of  Equation  3.4  is 


r,  =  I,-'  [’' 

[r 


(3.14) 


(3.15) 


where  is  a  vector  of  the  servo  torques  at  each  joint.  This  expression  can  be  inverted  as 
follows 


L  ^  Ty  = 


Combining  Equation  3.12  with  3. 16,  3.13,  and  3.14  gives 


(3.16) 


w  =  +  AL~ '  Ty 

The  six  joint  version  of  Equation  3.6  with  the  effects  of  external  forces  added  is 


—s^Mw  +  u/> 


(3.17) 


(3.18) 


where  M  is  the  system  mass  matrix  and  u/>  is  the  vector  representing  the  effect  of  control 
torques  from  Shuttle  attitude  thruster  firings.  Combining  Equations  3.16  and  3.18  obtains 


w  = —8  ’L  '  Ty+  (3.19) 

Placing  Equation  3.19  into  Equation  3. 17  results  in 

7  =  -  +  (3.20) 

Equations  3.19  and  3.20  are  the  six  joint  analogs  of  Ek]uations  3.9  and  3.10.  These  together 
define  the  equations  of  motion  for  the  Orbiter/SRMS/Payload  subsystem.  Given  the  torques 
produced  by  the  control  servos,  and  disturbances  caused  by  Shuttle  attitude  thruster  firings, 
the  state  vector  defining  the  end  effector  position  and  attitude  as  well  as  the  joint  angle  vector 
are  determined  by  the  above  system  of  equations. 
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3.2  Modelling  and  Linearization  of  the  Control  Servos 


The  general  approach  of  this  Section  is  to  characterize  the  transfer  function  between  the  joint 
angles,  7,  and  the  joint  rate  commands,  7^,  to  the  torque  output  at  the  joints,  Ty.  Section  3.2.1 
linearizes  this  nonlinear  system,  while  Section  3.2.2  characterizes  the  effect  of  significant  non- 
linearities. 

3.2.1  Control  Servo  Dynamics 

The  highest  fidelity  model  of  the  SRMS  servos  available  at  Draper  Laboratory  is  shown  in 
Figure  3.3.  This  model  was  designed  to  be  accurate  over  a  large  frequency  range  and  a  wide 
envelope  of  operations.  This  model  is  faithful  enough  to  even  encompass  the  accelerations  of 
the  motor  shaft  while  it  transitions  from  forward  driving  to  back  driving^  using  simulation  time 
steps  of  2  milliseconds  [8].  References  [8]  and  [10]  show  that  for  the  low  frequency  oscillations 
that  occur  with  heavy  payloads,  much  of  Figure  3.3  can  be  eliminated. 

References  [8]  and  [10]  both  derive  linearized  models  of  the  servo,  but  for  slightly  different 
purposes.  They  do,  however,  provide  useful  insight  into  the  nature  of  the  nonlinearities  as  well 
as  the  conditions  in  which  they  can  be  ignored.  Figure  3.4  represents  the  linearized  model  of 
the  servos.  Table  3.3  defines  the  variables  of  Figure  3.4.  All  quantities  are  either  vectors  or 
matrices  since  there  are  six  servos.  The  servos  are  all  similar  in  configuration;  differences  occur 
only  in  the  values  of  the  gear  ratios,  current  limiters,  moments  of  inertias,  and  gearbox  stiffness. 

Heavy  payloads  are  assumed  in  modelling  the  arm  dynamics.  Draper’s  experience  with  the 
SRMS  in  Position  Hold  mode  demonstrates  these  conditions  imply  low  frequency  motions  with 
moderate  amplitudes.  This  implies  we  can  neglect  high  frequency  dynamics.  In  linearization, 
the  effects  of  gear  backlash,  stictions,  and  frictions  are  ignored  since  they  have  negligible  effect. 
Simulations  have  shown  that  the  integral  trimmer  has  an  effect  only  on  motion  when  friction 
is  important;  therefore,  the  trimmer  and  its  limiter  are  eliminated  from  the  linear  model.  The 
analog  tachometer  with  its  high  pass  filter  has  negligible  effects  on  low  frequency  motions,  so  it  is 
eliminated  as  well  [8].  The  digital  tachometer’s  quantization  effects  are  ignored;  it  is  treated  as 
a  linear  gain.  While  in  Position  Hold,  the  current  limiter  saturates  before  the  torque  limiter  so 
the  torque  limiter  can  be  eliminated  from  the  model.  Indeed,  the  most  significant  nonlinearities 

’Forward  drive  ocrnrs  when  the  motor  in  driving  the  payload,  nark  drive  ornim  when  the  p^’load  in  driving  the 
motor. 
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Figure  3.3.  Draper  Servo  Model 
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Table  3.3.  Variable  Notation  for  Servo  Parameters 


Variable 

Description 

Units 

7 

Joint  Angle  Vector 

radians 

^7 

Vector  of  Torque  at  each  Joint 

ft-lb 

Joint  Rate  Commands 

rad/sec 

Motor  Shaft  Angle  Vector 

radians 

t 

Motor  Shaft  Rate  Vector 

rad/sec 

N 

Gear  Ratio 

- 

Motor  Speed  Command  Scaling  Gain 

count/rad/sec 

f^DA 

Digital-to-Analog  Converter  Gain 

v/cnt 

Ka 

Motor  Drive  Amplifier  Gain 

v/v 

KrIRi. 

Motor  Torquing  Gain 

ft-lb/ V 

Kn 

Effective  Gain  of  Digital  Tachometer  Processing 

cnt/rad/sec 

Kh 

Motor  Back  EMF  Constant 

v/rad/sec 

Rma 

Gain  of  the  Forward  Path:  KoaKaK-i  /Ri. 

ft-lb /cnt 

Kh'H 

Gain  of  the  Feedback  Path:  Ka  +  KhIKdaKa 

cnt/rad/sec 

Jm 

Motor  Rotor  and  Gearbox  Input  Shaft  Moment  of  Inertia 

slug-ft* 

Ka 

Gearbox  Stiffness 

ft-lb/rad 
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Figure  3.4.  Linearized  Servo  Model 

while  operating  in  Position  Hold  with  a  heavy  payload  are  the  motor  current  limiter  and  the 
gearbox  stiffness  curve.  These  cannot  be  dismissed  as  easily  as  the  other  nonlinearities.  Ignoring 
their  effects  could  lead  to  controller  designs  which  do  not  achieve  robust  performance  or  stability. 

In  an  attempt  to  capture  the  effect  of  these  nonlinearities,  uncertainty  blocks  are  used. 
Combining  the  gains  in  the  forward  path  of  the  servo  leads  to  a  single  gain,  Km  a-  This  gain 
includes  the  effect  of  the  current  limiter.  Since  it  has  the  potential  to  saturate  via  its  limiter, 
it  is  represented  in  Figure  3.4  with  an  uncertainty  block.  A,  around  it.  Gearbox  stiffness,  Ka, 
with  it’s  nonlinear  behavior  has  a  A  block  around  it  as  well.  The  mathematical  characterization 
of  these  A  blocks  is  discussed  in  the  next  Section. 

Figure  3.5  combines  the  results  of  the  servo  dynamics  and  the  arm  dynamics  in  block 
diagram  form.  This  linearized  version  of  the  SRMS  has  been  dubbed  SSAD’^.  If  not  for  the  A 
blocks,  it  would  be  short  work  to  put  this  linear  plant  model  into  state-space  form^ 

Xp  *  ApXp  "h  BpU 

y  =  CpXp  +  DpU  (3.21) 

where  z  is  the  plant  state  vector,  y  is  the  observation  vector,  and  u  is  the  control  vector. 
Equation  3.21  represents  the  nominal  state-space  form  of  SSAD. 

For  the  development  of  regulators  in  subsequent  Chapters,  measurements  of  the  end  effector 
position  and  attitude  for  feedback  are  assumed.  This  selection  of  measurements  provides  the 
necessary  observability  of  the  structural  flexibility  existing  in  both  the  SRMS  booms  and  the 
•'SSAD  irt.anrtK  for  Some  Singing  And  Dancing. 

‘From  now  on.  bam  are  left  off  verfom  nince  all  quantities  are  vertom  or  matrices  unless  specified  otherwise. 
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Figure  3.5.  Linearized  SRMS  Model 
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gearboxes  at  each  joint.  There  are  several  types  of  sensor  suites  that  can  recover  this  type  of 
information,  including  force-torque  sensors,  strain  gauges,  or  accelerometers.  Eventually,  the 
NASA/Johnson  Space  Center  (JSC)  plans  to  incorporate  one  of  these  set-ups  into  the  SRMS 
system. 

3.2.2  Representing  the  Uncertainties 

The  method  of  characterizing  parametric  uncertainty  presented  in  this  Section  follows  the 
work  of  Reference  [11].  As  shown  in  Figure  3.5,  the  state-space  model  of  SSAD  is  a  function 
of  several  physical  parameters,  two  of  which,  Ka  &nd  Km  a,  ^te  uncertain.  Using  the  form  of 
Equation  3.21  it  is  possible  to  model  these  parameter  errors  as 

Cp-t-E  +  E  “  (3-22) 

parameter  error  that  has  been  normalized  as 
-l<6i<l  Vi 

As  this  suggests,  the  nominal  value  of  both  Ka  and  Km  a  must  be  bounded  equally  from 
both  above  and  below.  Consider  Ka  first. 

The  gearbox  stiffness  curve  is  shown  in  Figure  3.6.  The  gain,  Ka,  is  the  slope  of  the  curve 
at  the  point  ~  1-  This  slope  can  achieve  values  from  zero  up  to  a  maximum  slope  that  is 
determined  by  the  servo  under  consideration.  One  choice  that  captures  all  possible  values  of 
Ka  is  to  choose  the  nominal  value  as  half  the  maximum  attainable  slope  with  bounds  above  and 
below  of  also  half  the  maximum  slope.  This,  however,  will  introduce  problems  when  performing 
robust  analysis.  Allowing  a  zero  value  for  Ac;  is  essentially  allowing  for  the  possibility  of  the 
servos  to  be  cut  off  from  the  rest  of  the  system.  All  of  the  servos’  associated  dynamics  would 
become  both  uncontrollable  and  unobservable  to  any  controller.  Such  a  system  will  always  fail 
robust  stability  analysis.  This,  however,  is  a  mathematical  fiction.  The  system’s  manifestation 
in  a  zero  Ka  state  is  transient,  not  a  situation  which  the  controller  must  deal  with  for  any 
length  of  time.  A  gain  of  0.01  is  used  as  the  minimum  value  of  Ka  in  order  to  pose  the  bounds 


y  = 

where  each  6;  represents 
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Figure  3.6.  Gearbox  Stiffness  Curve 


in  a  better  conditioned  manner.  For  a  value  of  Ka  below  0.01  we  are  assuming  that  the  system 
is  either  in  a  state  of  transition  or  in  a  relatively  unexcited  state.  is  selected  in  the 

center  of  this  range. 

The  current  limiter,  shown  in  Figure  3.7,  is  a  unit  gain  until  it  saturates.  After  saturation, 
the  limiter’s  behavior  can  be  characterized  as  a  gain  that  is  inversely  proportional  to  the  input 
current.  The  minimum  gain  achievable,  K^int  the  current  limit  value  divided  by  the  maxi¬ 
mum  current  possible  in  the  system,  both  known  quantities  for  each  servo.  Since  the  current 
limiter  has  been  absorbed  into  Km  a.  Km  a  will  have  a  range  from  KminKMA  up  to  Kma- 
KMAnom  selected  in  the  center  of  this  range. 

There  are  now  twelve  parameters,  six  Ka  and  six  Kmaj  nominally  centered  in  ranges,  but 
free  to  vary  independently  of  each  other.  To  calculate  every  possible  AS,,  ACi  and  ADi 
these  parameters  would  generate  through  their  ranges  would  be  computationally  intensive.  As 
a  first  order  approximation,  AA,,  ABi,  ACi  &ud  ADi  ufc  computed  by  sending  each  parameter 
individually  to  its  upper  and  lower  bound  to  reduce  computation.  This  will  generate  a  set  of 
24  perturbed  matrices. 

The  system  can  now  be  written  in  the  perturbed  state-space  form  of  Equation  3.22.  The 
perturbation  matrices  associated  with  each  uncertain  parameter  can  be  combined  into  a  single 
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Figure  3.7.  Current  Limiter 


(3.23) 


matrix 

^  AAi  AS. 

ACi  AD, 

where  n,  =  dim(z),  riy  =  dim(j/),  and  n»  =  dim(ti).  This  matrix  is  not  full  rank  for  the  SSAD 
system  since  any  one  parameter  does  not  affect  all  of  the  states  and  outputs.  Ni  can  therefore 
be  decomposed  as 

r 

Q. 


N,= 


Ri 


Si  T, 


(3.24) 


where  for  Rank(;V,)  =  n„  Q,  €  Ri  Si  G  T,  6  The 

state-space  model  of  the  perturbed  system  can  be  rewritten  as 


Ap  +  T,  Q>^iJn,Si 

i=1 


=  ApXp  +  [Qi  ...  Q*] 


y  = 


Cp  +  Y,Ri6iIn,Si 

i=1 


Xj,+ 

. 

Xp  + 


k 

I 

i=1 


Bp  +  ^Q,SiIn,Ti 


+  BpU 


Dp  +  Y,Ri^^^n,Ti 

i=1 


(3.25) 
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=  CpXp  +  [iii  . . .  i2t] 


»/* 


+  Dpu 


(3.26) 


'  5,  ■ 

* 

= 

• 

Xp  + 

• 

(k 

Tt 

(3.27) 


The  Ci  represent  the  inputs  to  the  uncertainty  block,  A.  They  are  a  function  of  Xp  and  u. 
The  r\i  represent  the  outputs  of  the  uncertainty  block  which  will  influence  the  state  dynamics. 
This  models  the  effect  the  parameter  uncertainties  have  on  the  plant  dynamics.  Define 


Q  =  [Q^■■  Qk] 

R  =  [/?,...  R,] 

S  =  [Sr...S’l) 

T  =  [T/’ ...  r*!’] 

Augmenting  the  unperturbed  state-space  of  Equation  3.21  with  process  noise,  u;,  measure¬ 
ment  noise,  v,  and  the  effects  of  parameter  uncertainty  as  defined  above  gives 


Xp  =  ApXp  +  BpU  -I-  B,„w  -f-  Qt] 
y  =  CpXp  +  DpU  +  D^v  +  Rf) 

e  =  Sxp  +  Tu  (3.28) 


Figure  3.8  represents  the  general  transfer  function  of  a  nominal  plant,  P(s),  and  compen¬ 
sator,  K(s),  with  parametric  uncertainty,  A.  In  this  representation,  d  represents  all  exogenous 
inputs  into  the  system;  in  this  case 


d  = 


w 


(3.29) 


and  e  is  an  error  vector  of  signals  of  interest  to  be  defined  later.  The  state-space  representation 
of  Figure  3.8  is 
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Figure  3.8.  Standard  Form  for  a  Compensated  Plant  with  Uncertainty 


X  =  Ax  Bj  Tf  -j-  B2d  +  B^ u 
(  —  C-^x Diit)  D^2d D\^u 

c  =  C2*  +  D2\t}  +  Djjd  +  D2^u 

y  =  Cn*  +  022*7  "h  0.i2<^  +  D^:\u 

Placing  Equation  3.28  in  the  form  of  Equation  3.30  gives 


'p 

Ap 

Q 

B2 

5p  ■ 

f 

S 

0 

0 

r 

*7 

e 

C2 

O21 

O22 

O2.1 

d 

y 

R 

0.12 

Dp  . 

u 

(3.30) 


(3.31) 


O2  —  [S«.  :  0(„^x„^)] 

0.12  =  [0(„,xn,„)  :  O^] 

Equation  3.31  represents  the  state-space  equation  for  the  linearized  SRMS  with  the  nonlin¬ 
earities  Km  A  and  Ka  modeled  as  parameter  uncertainties.  One  clarification  needs  to  be  made 
about  the  matrix  B2-  This  matrix  characterizes  how  the  process  noise  affects  the  state.  This 
matrix  comes  from  the  u/;  term  of  equation  3.19.  u/;  represents  the  effect  of  Shuttle  attitude 
thruster  firings  on  the  end  effector.  Recall  that  while  in  Position  Hold  mode  the  Shuttle  is 
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restricted  from  using  its  translational  thrusters,  therefore  u/>  will  effect  only  the  rotational 
states.  However,  for  computation  of  a  linear  optimal  estimator  such  as  one  employed  by  model 
based  compensators,  every  state  must  be  controllable  by  the  noise.  To  avoid  problems  before 
they  occur,  the  effects  of  fictitious  Shuttle  translational  thruster  firings  are  added  to  82- 

The  actual  construction  of  u/>  depends  upon  the  Orbiter /Pay load  configuration.  On  a  given 
Shuttle  mission,  pre-flight  evaluations  select  a  certain  set  of  jets  to  fire  for  a  roll,  pitch  or  yaw 
command  for  a  given  Orbiter/Payload  configuration.  Barring  a  system  failure,  those  same  jets 
are  used  every  time.  Since  the  force  a  particular  jet  will  impart  to  the  end  effector  can  be 
calculated,  the  disturbance  vector,  ud,  has  six  elements  corresponding  to  a  roll,  pitch,  yaw, 
X  translation,  y  translation,  or  z  translation  command.  Such  a  command  is  normalized  to  lie 
between  -1  and  1  corresponding  to,  for  example,  a  -roll  command  or  a  -broil  command. 

3.3  Comparison  of  the  Linear  and  Nonlinear  Plant 

There  are  several  simplifications  made  in  developing  the  linearized  SSAD  from  the  nonlinear 
LSAD.  This  Section  compares  the  two  simulations  to  expose  the  deficiencies  in  the  linearization 
process.  Frequency  domain  information  such  as  singular  value  decompositions  are  not  available 
for  the  LSAD.  Instead,  comparisons  are  made  in  the  time  domain  while  running  both  systems 
in  Position  Hold  Mode.  Figure  3.9  shows  a  block  diagram  of  SSAD  in  Position  Hold  Mode. 
The  Position  Hold  controller  does  nothing  more  than  feed  back  the  joint  angles,  7,  to  the  servos 
through  a  set  of  gains,  Ky . 

The  Hubble  Space  Telescope  (HST)  in  its  release  configuration  is  selected  as  the  payload  on 
a  standard  Orbiter.  HST  weighs  25,000  lbs,  so  the  heavy  payload  assumption  is  not  violated. 
HST  in  its  release  configuration  will  be  used  in  subsequent  Chapters  for  all  designs  since  when 
this  research  began,  ample  data  was  available  on  this  payload  in  this  configuration. 

Recall  that  both  A  and  K(;  have  a  range  of  values.  The  nominal  state-space,  Pnom(*)i 
places  both  Km  a  and  Ka  at  the  midpoint  of  their  respective  ranges.  Since  Pnom(s)  is  the 
plant  that  will  be  used  in  controller  design,  Pnom(s)  in  Position  Hold  Mode  is  compared  with 
LSAD. 

Both  SSAD  and  LSAD  are  run  in  a  Position  Hold  closed-loop  with  initial  conditions  on 
end  effector  velocity,  w.  This  scenario  models  the  situation  occuring  when  the  SRMS  drops 
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8«rvo  dynamics  area  dynamics 


commands 


Figure  3.9.  SSAD  in  Position  Hold 

into  Position  Hold  Mode  following  a  maneuver;  the  SRMS  is  instructed  to  maintain  it’s  current 
position  and  attitude  though  residual  velocities  remain  from  the  maneuver.  Recall  that  the  end 
effector  state  vector  is  constructed  as  follows 


(3.32) 


(■  X  position  ' 

Y  position 
q  Z  position 

6  Roll  attitude 

Pitch  attitude 
L  Yaw  attitude  . 

Figures  3.10  and  3.11  compare  the  translational  and  rotational  response  of  the  end  effector 
to  an  initial  velocity  for  both  LSAD  and  the  nominal  SSAD.  Both  amplitudes  and  frequencies 
are  similar.  Considering  the  linearization  process,  SSAD  compares  reasonably  well  with  LSAD. 

Recall  that  Figures  3.10  and  311  represent  the  nominal  SSAD  plant.  In  the  nominal  plant, 
both  K \i  4  and  K(;  have  been  placed  in  the  center  of  their  ranges  for  design  and  robust  anal¬ 
ysis  purposes.  Hopefully,  the  uncertainty  ranges  on  these  gains  capture  the  behavior  of  the 
nonlinear  plant.  Following  this  logic,  there  should  exist  off-nominal  values  for  both  Km  a  and 
K(}  that  will  make  the  comparison  between  LSAD  and  SSAD  more  favorable  for  these  initial 
conditions.  Looking  at  Figures  3.10  and  3.11  it  seems  that  SSAD  is  damping  quicker  than 
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Figure  3.10.  Translation  Response  of  LSAD  and  Nominal  SSAD  to  an  Initial  Velocity 
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Figure  3.11.  Rotational  Response  of  LSAD  and  Nominal  SSAD  to  an  Initial  Velocity 


15 


LSAD.  After  moving  Ka  towards  the  lower  end  of  its  range  to  compensate  this  damping,  and  a 
little  experimentation  with  increasing  Km  a  as  well,  Figures  3.12  and  3.13  present  an  improved 
match  between  LSAD  and  SSAD. 

3.4  Conclusions 

This  Chapter  uses  several  assumptions  to  bring  the  SRMS  Position  Hold  plant  into  a 
workable  linear  form.  SSAD  model  accuracy  is  restricted  to  payloads  greater  than  5,000  lbs. 
This  is  not  a  crippling  assumption.  The  25,000  lbs  Hubble  Space  Telescope  is  used  for  design  in 
this  thesis.  Other  assumptions  are  made  in  linearizing  the  nonlinear  plant.  Several  nonlinearities 
are  ignored  whose  effects  are  small  in  the  envelope  of  design.  That  is,  they  either  have  negligible 
contribution,  or  dominate  at  high  frequencies.  Two  significant  nonlinearities.  Km  a  &iid  Ka, 
are  represented  as  parameter  uncertainties.  These  steps  are  all  necessary  to  obtain  a  workable 
plant  model  such  that  linear  control  design  techniques  can  be  applied.  The  drawback,  however, 
is  that  even  if  robust  stability  can  be  guaranteed  on  SSAD,  this  guarantee  will  not  translate 
to  the  full  nonlinear  plant.  SSAD  can  provide  a  reasonable  indication,  but  not  an  assurance  of 
robust  stability.  The  standard  three  block  form  for  design  and  robust  analysis  presented  in  this 
Chapter  will  be  exploited  in  subsequent  Chapters  to  provide  easy  ways  to  compare  controllers 
designed  by  different  techniques. 
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Figure  3.12.  TVansIation  Response  of  LSAD  and  off-nominal  SSAD  to  an  Initial  Velocity 
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Chapter  4 


Design 


and  Analysis  Approach 


The  24  state  linearized  model  of  the  SRMS  developed  in  Chapter  3  is  used  in  subsequent 
Chapters  for  regulator  design  using  three  techniques:  LQR,  J/2,  and  Noo  optimization.  Before 
any  of  these  techniques  are  pursued,  it  is  appropriate  to  consider  the  characteristics  of  the 
open-loop  plant  and  their  possible  effects  on  the  design  process.  This  Chapter  examines  several 
aspects  of  the  plant  and  anticipates  how  some  can  cause  problems  in  both  the  design  and 
analysis  process^.  Stability  and  performance  criteria  upon  which  each  regulator  design  are 
evaluated  is  introduced  as  well. 

Section  4.1  examines  the  open-loop  plant  and  suggests  advantages  of  precompensation  loop 
closure.  Model  reduction  is  applied  to  the  open-loop  plant  to  reduce  it  to  18  states,  thereby 
reducing  the  order  of  any  resulting  compensator.  The  singular  values  of  the  reduced  ordt. 
system  are  compared  to  the  full  order  system.  The  poles  and  transmission  zeros  of  the  reduced 
order  plant  are  evaluated.  Issues  such  as  controllability  and  observability  are  also  considered. 

Section  4.2  concentrates  on  analysis  of  the  open-loop  plant.  The  criteria  considered  are 
the  same  that  are  applied  to  each  compensator  design  in  order  to  provide  a  common  measure 
for  evaluation.  The  singular  values  of  the  transfer  functions  from  disturbance  to  end  effector 
position  and  attitude  are  inspected.  The  transfer  function  from  disturbance  to  control  effort  is 
scrutinized  as  well.  In  face  of  the  parametric  uncertainty  present  in  SSAD,  robust  stability  is 
considered  via  the  small  gain  theorem.  As  a  less  conservative  method,  the  structured  singular 
value,  ft,  is  also  used  in  analysis.  Due  to  the  nature  of  the  uncertainty,  /i-analysis  is  still 
'This  may  seem  like  foresight,  in  the  thesis,  though  it  is  artually  hindsight  in  the  research. 
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conservative.  The  impact  this  will  have  on  the  analysis  process  is  explored  as  well. 


4.1  Obtaining  a  Design  Plant 

The  first  inclination  when  designing  a  new  compensator  for  a  system  is  to  remove  the 
old  compensator  and  then  proceed  to  the  design  process  with  the  resulting  open-loop  plant. 
Such  an  open-loop  plant  was  presented  in  Figure  3.5.  The  poles  of  this  open-loop  plant  are 
plotted  in  Figure  4.1.  With  no  feedback  to  the  system,  as  expected,  there  are  six  poles  residing 
at  the  origin.  These  rigid  body  modes,  though  not  welcome,  would  be  accepted  except  that 
computationally  they  do  not  appear  exactly  at  the  origin.  Machine  precision  places  them  at 
some  small  e  on  either  side  of  the  ju-axis.  This  e  propagates  through  the  design  process  causing 
computational  problems  later. 

One  method  of  dispensing  with  this  dilemma  is  to  leave  the  joint  2tngle  feedback  loop  intact. 
The  current  Position  Hold  controller  as  shown  in  Figure  3.9  consists  of  only  a  gain  matrix. 
Leaving  the  current  Position  Hold  loop  in  place  and  designing  around  it  can  be  thought  of  as 
precompensation  loop  closure.  Since  there  are  no  dynamics  associated  with  the  Position  Hold 
gains,  designing  around  them  will  not  increase  the  order  of  a  resulting  compensator.  There  are 
several  advantages  to  this  precompensation  loop  closure. 

Each  joint  operating  in  the  current  Position  Hold  Mode  behaves  as  a  spring  with  damping; 
the  joint  angle  feedback  of  Position  Hold  will  pull  the  origin  poles  decisively  into  the  left 
half  plane.  This  not  only  alleviates  the  computational  problems  discussed  above,  but  it  also 
produces  a  non-singular  system  A  matrix.  Invertability  of  the  system  would  allow  for  the 
application  of  loop-shaping  techniques  in  the  LQG/LTR  methodology  such  as  those  presented 
in  References  [13]  and  [12].  Furthermore,  when  designing  an  enhancement  to  a  system  already 
in  use,  it  is  conceptually  more  palatable  to  design  the  upgrade  as  stability  or  performance 
augmentation  such  that  the  improvement  will  work  around  the  current  system  rather  than 
replacing  it.  If  necessary,  the  augmentation  to  the  software  can  be  dissabled  leaving  the  orginal 
system  intact. 

For  the  reasons  outlined  above,  the  current  Position  Hold  system  is  treated  for  ail  intents 
and  purposes  as  the  open  loop  plant.  Henceforth,  in  all  discussion  of  analysis  of  the  open-loop 
plant,  it  is  the  original  Position  Hold  system  that  is  actually  being  considered.  The  poles  of 
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this  system  are  shown  in  Figure  4.2. 

The  six  distant  poles,  which  correspond  to  the  dynamics  of  the  motor  rates,  are  considerably 
faster  than  the  rest  of  the  system.  They  are  even  faster  than  the  fastest  possible  sample  rate 
of  80  milliseconds  in  the  actual  system.  Eliminating  the  fast  poles  will  reduce  the  model  to 
18  states  and  thereby  reduce  the  order  of  a  compensator.  As  long  as  the  bandwidth  of  a 
compensator  designed  from  the  reduced  order  model  (ROM)  is  kept  relatively  small,  there  will 
be  no  interference.  To  insure  problems  do  not  occur  with  performance  or  stability  as  a  result 
of  the  model  reduction,  the  ROM  is  used  for  controller  design  purposes  only.  The  full  order 
model  (FOM)  is  always  used  in  analysis.  This  will  isolate  stability  robustness  problems  to  the 
parameter  uncertainties  rather  than  confusing  the  issue  with  neglected  dynamics.  Section  4.1.1 
examines  a  method  for  eliminating  the  fast  dynamics  of  the  system,  while  Section  4.1.2  explores 
the  characteristics  of  the  ROM  and  compares  it  to  the  FOM. 


4.1.1  Model  Reduction 


Denote  the  transfer  function  from  joint  rate  commands  to  end  effector  position  and  attitude 
as 

G{s)  :=  (A,B,C.D) 


This  can  be  decomposed  as 

G(^»)  =  [G(s)],  +  [G(s)]^ 


(4.1) 


where 

[G{s)l  :=  [Au,By,C^,D^) 
represents  the  slow  modes  of  the  system,  and 

[G(S)],  :=  (A22,B2,C2,D2) 

represents  the  fast  modes  to  be  cut  from  the  system.  Finding  the  matrices  which  define  the 
decomposed  system  requires  the  use  of  the  algorithm  presented  in  Reference  [14]  in  the  manner 
outlined  below; 

Using  an  ordered  Schur  decomposition,  find  a  unitary  matrix,  V  such  that 


A  =  VAV'' 


All  Ai2 

0  A22 


(4.2) 
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Consider  Ai  to  be  the  i'*  eigenvalue  of  some  matrix,  A.  Using  the  ordered  Schur  form,  one 
can  find  and  ^422  such  that 

|A.(i„)|<|Ai(i22)|  (4.3) 

For  SSAD,  the  system  is  divided  into  its  6  fast  modes  and  18  slow  modes.  Next,  solving  the 
following  equation  for  X 


and  then  computing 


A-ifX  —  X Axi  "I"  A-\2  —  0 


(4.4) 


and 


VB 


(4.5) 


I  X 

[c,  C2]  =CK' 

0  / 

one  arrives  at  the  state-space  forms 


(4.6) 


A^^ 

By 

Cl 

D 

[G(s)]^  := 

Mi 

B2 

Cl 

0 

(4.7) 


(4.8) 


Notice  that  the  full  DC  term  of  the  system,  D,  is  preserved  in  the  slow  system.  Removing 
the  6  fast  poles  in  this  manner  reduces  the  system  to  18  states.  The  next  Section  will  examine 
what  information  is  lost  in  the  reduction  process  and  explore  what  effect  this  will  have  on 
design. 


4.1.2  Characteristics  of  the  ROM 

Figure  4.3  plots  the  singular  values  of  the  ROM  and  the  FOM.  No  information  is  lost  until 
well  beyond  the  bandwidth  of  the  open-loop  system.  In  the  lower  frequency  range  where  the 
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ROM:  Solid  FOM:  Dashed 


Frequency  (radians/second) 


Figure  4.3.  Singular  Values  from  Joint  Rate  Commands  to  End  Effector  State  for  the  ROM 
and  FOM 

system  has  significant  gain,  the  two  models  are  identical.  Whatever  DC  effect  the  fast  poles 
of  the  FOM  had  on  the  system  is  preserved  in  the  reduction  process.  Also  evident  from  the 
singular  value  plots  is  the  poor  scaling  of  the  system.  This  can  be  seen  from  the  distance 
between  individual  singular  values.  This  ill-scaling  is  to  be  expected  since  the  end  effector 
state  vector  is  comprised  of  translations  measured  in  inches,  and  rotations  measured  in  radians. 
Certainly  scaling  must  be  applied  before  design  so  that  a  regulator  will  not  consider  a  one  inch 
error  and  a  one  radian  error  with  equal  gravity. 

Figure  4.4  displays  the  poles  of  the  ROM.  The  18  poles  of  the  ROM  reside  in  the  same 
locations  as  the  near  poles  of  the  FOM  as  shown  in  Figure  4.2;  as  desired,  the  fast  dynamics 
have  been  eliminated. 

Figure  4.5  bears  silent  witness  to  the  fact  that  the  model  is  non-minimum  phase.  Note, 
however,  that  all  of  the  transmission  zeros  are  exceedingly  fast  relative  to  the  rest  of  the 
system. 

We  can  study  the  controllability  and  observability  properties  of  the  system  since  SSAD  in 
its  nominal  configuration  is  a  linear  time-invariant  system.  Consider  the  following  gramians; 
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A  necessary  and  sufficient  condition  for  controllability  is  that  the  controllability  gramian 

Lr  (t)  =  J*  dt  (4.9) 

must  be  full  rank.  Dually,  a  necessary  and  sufficient  condition  for  observability  is  that  the 
observability  gramian 

L„  (t)  =  jf  di  (4.10) 

must  be  full  rank.  If  i  is  allowed  to  go  to  infinity  and  A  is  stable,  then  Lr  and  L„  can  be  found 
via  the  following  Lyapunov  equations; 

ALr  +  LrA'+BB'’  =  0  (4.11) 

A''L„  +  L„A  +  C'C  =  0  (4.12) 

Evaluating  these  gramians  for  the  ROM  ascertains  the  system  to  be  fully  controllable  and 
observable.  Recall  that  the  end  effector  position  and  attitude  are  the  measurements  used  for 
the  new  regulator  designs.  The  controls  are  the  six  joint  rate  commands.  Table  4.1  presents  the 
singular  values  for  the  controllability  and  observability  gramians.  Though  the  system  is  fully 
controllable  and  observable  to  machine  precision,  this  can  be  slightly  misleading.  Looking  at 
Table  4.1  some  modes  are  clearly  more  controllable  and/or  more  observable  then  others.  This 
implies  that  certain  modes  may  require  exorbitant  amounts  of  control  effort  if  they  are  to  meet 
certain  performance  requirements.  Fortunately,  however,  all  of  the  modes  are  stable. 

All  of  these  issues  are  important  to  keep  in  mind  during  the  design  process.  There  will  be 
limits  to  how  far  certain  poorly  controllable  poles  can  be  placed  because  of  restrictions  on  the 
control  effort.  Keep  in  mind  that  the  reduced  order  model  presented  in  this  Section  will  only 
be  used  for  design  purposes,  the  full  order  model  will  be  used  for  analysis. 


4.2  Analysis  Techniques  and  the  Open-Loop  Plant 

Before  proceeding  to  design  a  new  regulator  to  augment  the  performance  of  the  current 
Position  Hold  controller,  let  us  consider  Position  Hold  performance  as  it  stands  now.  This 
Section  will  analyze  the  current  system  following  a  threefold  path.  First,  Section  4.2.1  considers 
the  transfer  function  from  disturbance  to  end  effector  position  and  attitude  respectively  to 
determine  system  performance.  Second,  Section  4.2.2  examines  the  transfer  function  from 
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Table  4.1.  Gramian  Singular  Values 


Controllability  Gramian 

Singular  Values 

Observability  Gramian 

Singular  Values 

2.0767e+07 

2.1420e+05 

7.0207e+05 

1.4658e-01 

2.0295e+07 

1.3638e+05 

3.6847e+05 

7.7801e-02 

7.1140e+06 

8.5671e+04 

1.0944e+04 

3.8524e-02 

1.8852e+06 

6.6693e+04 

4.3336e+00 

3.4467e-02 

1.8332e+06 

4.7181e+04 

1.7348e+00 

1.6510e-02 

1.4352e+06 

2.6289e+04 

1.2482e+00 

4.7992e-03 

9.2866e+05 

2.7779e+00 

4.8469e-01 

9.0874e-04 

3.5116e+05 

1.1020e+00 

2.6327e-01 

2.4196e-04 

2.6732e+05 

8.8740e-01 

1.7180e-01 

1.5619e-05 
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disturbance  to  control  effort  in  order  to  establish  how  much  control  effort  is  required.  Finally, 
Section  4.2.3  looks  at  the  stability  robustness  of  the  system.  This  method  of  system  analysis  is 
the  same  procedure  applied  to  every  new  controller  designed  in  subsequent  Chapters. 


4.2.1  Performance 

In  evaluating  the  effect  of  a  disturbance  on  the  end  effector  state,  only  the  rotational  Shuttle 
commands  are  considered  since  the  effect  of  fictitious  translational  jets  were  added  only  so  that 
mathematically  every  state  would  have  process  noise.  In  fact,  for  Shuttle  RMS  operations, 
translational  commands  are  prohibited.  As  pointed  out  in  Section  3.2.2,  the  disturbance  is 
normalized  to  lie  between  -1  and  1;  let  us  consider  the  singular  values  from  disturbance  to  the 
end  effector  in  two  sets.  The  first  set  will  include  the  end  effector  position  in  inches  and  the 
second  set  will  consist  of  the  end  effector  attitude  in  radians.  In  this  manner  the  singular  values 
will  retain  their  units  making  the  evaluation  of  performance  an  easier  task.  Figure  4.6  displays 
these  two  sets  of  singular  values  for  the  current  Position  Hold  regulator. 

From  Figure  4.6  we  can  see  that  the  end  effector  excursion  from  equilibrium  can  be  as  great 
as  5  inches  in  position  and  as  much  as  0.03  radians  in  attitude  for  certain  Orbiter  rotation 
command  directions.  The  goal  in  designing  a  new  regulator  is  to  bring  these  maximum  excur¬ 
sions  down  as  far  as  possible  without  exceeding  the  limitations  on  the  control  effort  described 
in  the  next  Section.  Ideally  we  aim  for  less  than  1  inch  excursion  in  position  and  less  than  0.01 
radians  of  rotation  excursion. 

4.2.2  Control  Effort 

There  are  already  in  place  certain  joint  rate  command  limits,  ie.,  limitations  on  the  control 
effort.  These  limits  are  not  based  on  any  hardware  constraints,  rather  they  are  safety  settings 
to  insure  a  payload  at  the  end  effector  isn’t  commanded  too  rapidly  bringing  the  system  into 
precarious  areas  of  operation.  These  settings  are  a  function  of  the  payload  inertia.  The  limits 
are  calculated  to  maintain  a  prescribed  stopping  distance  when  operating  the  RMS.  If  new 
controllers  can  exhibit  improved  capability  to  control  the  end  effector,  there  may  be  justification 
for  increasing  the  joint  rate  limits. 

For  design  purposes,  the  joint  rate  limits  will  be  treated  as  hard  constraints  that  must 


59 


magnitude  magnitude 


Figure  4.6.  Performance  Singular  Values  of  the  Current  Position  Hold  System 
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Table  4.2.  Joint  Rate  Limits 


Joint 

Joint  Rate  Limit  (rad/sec) 

Shoulder  Yaw 

0.0052 

Shoulder  Pitch 

0.0052 

Elbow  Pitch 

0.0072 

Wrist  Pitch 

0.0107 

Wrist  Yaw 

0.0107 

Wrist  Roll 

0.0107 

be  met.  In  this  manner,  the  lower  bound  on  performance  will  be  established.  Increasing  the 
joint  rate  limits  will  allow  a  new  controller  to  use  more  control  and  thus  allow  performance  to 
improve.  Table  4.2  presents  the  current  joint  rate  limits. 

To  determine  the  control  effort  used  by  the  system  for  a  disturbance  caused  by  Shuttle 
attitude  thruster  firings,  consider  the  singular  values  of  the  transfer  function  from  the  attitude 
firings  to  each  individual  joint  rate  command.  There  are  six  joints  and  six  different  joint  rate 
limits.  Let  us  scale  each  joint  rate  command  by  its  corresponding  joint  rate  limit.  In  this 
fashion,  as  long  as  the  singular  values  of  these  six  transfer  functions  remain  below  unity,  the 
joint  rate  limits  have  not  been  exceeded.  As  Figure  4.7  shows,  the  current  system  is  safely 
below  the  joint  rate  limits. 

Keep  in  mind  that  all  performance  and  control  effort  analysis  performed  so  far  has  been 
on  the  nominal  plant.  Recall  there  are  two  parameters.  Km  A  &nd  Kff,  that  include  additive 
uncertainty.  The  next  issue  to  consider  is  robust  stability  in  face  of  this  uncertainty. 

4.2.3  Stability  Robustness 

Recall  the  standard  form  for  a  compensated  plant  with  uncertainty  shown  in  Figure  3.8.  The 
closed-loop  version  of  this  is  shown  in  Figure  4.8.  G(s)  is  the  closed-loop  transfer  function  which 
combines  the  dynamics  of  the  plant  and  the  compensator  and  A  is  the  modeling  uncertainty. 

The  closed-loop  transfer  function,  G(s),  can  be  divided  into  a  set  of  four  transfer  functions 
as  follows. 
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Figure  4.9.  Closed-Loop  System  for  Robust  Stability  Analysis 


€  _  Gn(s)  Gi2{s) 
e  G2i(s)  G22(«) 

In  this  representation,  the  nominal  closed-loop  system  is  given  by  G22(*)-  bet  us  redraw 
Figure  4.8  as  shown  in  Figure  4.9.  In  this  representation,  looking  at  the  A  and  Gii(s)  transfer 
functions,  the  small  gain  theorem  can  be  invoked. 

By  invoking  the  small  gain  theorem,  a  sufficient  condition  for  closed-loop  stability  is 

||G„(s)A|U  <  1  (4.14) 


(4.13) 


The  Hoc  norm  is  defined  in  the  frequency  domain  for  a  stable  transfer  function  matrix  as 


||G(fi)||oc  =  su^pd-[G(ju;)] 

-  In  Mi 

—  sup  ..  11 

||w||2 


(4.15) 


Therefore,  the  /foe  norm  is  the  induced  2-norm. 

In  Section  3.2.2,  A  was  defined  to  be  normalized  to  less  than  one.  Therefore,  Equation  4.14 
can  be  reduced  to 

IIGiUOIIoc  <  1  (4.16) 

This  small  gain  test  is  valid  for  any  complex  A  with  norm  less  than  one.  However,  for  indepen- 
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dent  modeling  uncertainties,  A  is  known  to  have  a  block  diagonal  structure 


A  = 


A,  0  ...  0 

0  Aj 

••  0 

0  ...  0  A„ 


For  the  SSAD  plant,  there  are  two  block  diagonal  elements  for  and  K().  Since  the  small 

gain  test  must  hold  for  all  normalized  A,  not  just  the  more  restricted  set  of  normalized  block 
diagonal  A  matrices.  Ad,  it  is  a  conservative  test.  Because  it  is  known  the  perturbations  of 
SSAD  are  real  parameter  variations,  testing  for  complex  perturbations  makes  the  small  gain 
test  even  more  conservative. 

In  an  effort  to  supply  a  less  conservative  robust  stability  test,  Doyle  [15]  developed  the 
structured  singular  value,  p.  p  is  defined  as 


p(G(jw)]  =  ^ 


0  if  det[7  —  G(jw)A(jw)]  ^  0  for  all  A  €  Ad 

{min^eAn  det[/  —  G(jw)A(jw)]  =  0  otherwise 


(4.17) 

Consider  From  the  definition  of  p,  if  p[Gn (ju>)]  >  1  at  some  frequency  w  then 

there  exists  a  block  diagonal  A(jw)  such  that 


d[Ai(ju;)]  <  1  and  det[7  -  Gii(jw)A(ja;)]  =  0 

Conversely,  if  <  1  for  all  u,  then  the  smallest  block  diagonal  A  matrix  that  makes 

the  system  singular  has  the  gain 

ff[Ai(ju;)]  >  1 

which  violates  the  norm  of  the  modeling  error.  Therefore,  a  necessary  and  sufficient  condition 
for  robust  stability  when  A  6  Ad  is 


p[Gii(j<.;)]  <  1  Vw  (4.18) 

Unfortunately,  in  the  actual  application  of  p-analysis  we  are  reduced  to  conditions  which 
are  sufficient  but  not  necessary  for  two  reasons.  First  of  all,  p  is  not  a  norm  and  it  cannot  be 
computed  easily.  Software  is  available  that  can  calculate  upper  bounds  for  p.  However,  using 
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an  upper  bound  adds  some  conservatism  to  the  analysis.  The  second  source  of  conservatism 
again  comes  from  the  fact  that  SSAD  has  real  parametric  uncertainty.  The  set  A|)  still  allows 
for  complex  perturbations  as  in  the  small  gain  test.  This  is  a  particularly  bothersome  problem 
for  the  SSAD  plant  which  contains  lightly  damped  poles  with  large  error.  With  lightly  damped 
poles,  even  small  complex  perturbations  can  cause  the  condition  for  robust  stability  to  be 
violated,  ie.,  >  1.  Consider  the  following  simple  example. 


Example  4.1:  Spring-Mass-Damper  System  with  Uncertain  Spring  Constant 

A  spring-mass-damper  system  with  a  unit  mass  can  be  represented  by  the  differential 
equation 

X  +  cz  +  kx  =  f  (4.19) 

where  c  is  the  viscous  damping  coefficient  given  as  0.1,  k  is  the  spring  constant,  and  /  is  a 
disturbance  force.  The  spring  constant  is  uncertain,  but  modeled  to  fall  in  the  range 


0.5  <  ib  <  2.5 


If  k  is  selected  in  the  center  of  its  range  for  the  nominal  system,  a  state-space  representation  is 
given  by 

^2 

X  =  Ax  -1-  Bf 


0 

1 

0 

-1.5 

-0.1 

X2 

+ 

1 

(4.20) 


The  nominal  system  has  stable  roots  at  A  =  —0.05  i:  1.224i’.  In  fact,  throughout  the  range  of  k 
the  system  remains  stable  with  roots  at  A  =  —0.05  i:  0.705<  for  the  high  end  of  the  range  and 
roots  at  A  =  — 0.05±1.5803i  for  the  low  end  of  the  range.  The  system  is  robustly  stable  in  face  of 
the  parametric  uncertainty.  Nevertheless,  since  p-analysis  will  consider  complex  perturbations, 
this  system  will  fail  the  p  test  for  stability. 

Consider,  as  an  example,  the  perturbation  matrix 


AA  = 


0  0 
i  0 
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Figure  4.10.  Robust  Stability  Analysis  of  the  Current  Position  Hold  System 

whose  norm  is  one.  Though  this  perturbation  cannot  occur  in  the  real  system,  /i-analysis 
will  test  for  such  a  A.  This  perturbation  places  the  roots  at  Ai  =  0.3394  j-  1.284t  and  A2  = 
—0.4394  —  1.284t  failing  robust  stability. 

Though  ^-analysis  does  not  supply  a  necessary  condition  for  robust  stability,  it  does  reduce 
conservatism  over  the  small  gain  test.  To  illustrate  this.  Figure  4.10  displays  the  results  of 
the  small  gain  test  and  p-analysis  performed  on  the  current  Position  Hold  controller.  Though 
/i-analysis  is  less  conservative  than  the  small  gain  test,  the  plant  still  fails  robust  stability. 

To  determine  if  the  instability  displayed  in  Figure  4.10  is  the  result  of  a  fictitious  complex 
perturbation,  or  a  configuration  the  system  can  actually  realize,  it  is  possible  to  evaluate  the  pole 
locations  of  the  system  for  different  parameter  values.  To  analyze  every  possible  combination  of 
the  twelve  parameters  would  not  be  feasible;  however,  it  is  reasonable  to  check  the  endpoints. 

Consider  when  the  end  effector  is  very  close  to  its  equilibrium  position  and  attitude.  At 
these  very  small  amplitudes,  the  current  limiter  has  not  saturated,  hence  all  six  of  the  Km  A  "c 
at  their  maximum  values.  Furthermore,  at  very  small  amplitudes  the  joint  angles  and  motor 
shaft  angles  are  almost  aligned,  thus  the  six  Ka  are  at  their  minimum  values.  Figure  4.11 
displays  the  pole  locations  of  the  system  with  the  parameters  Km  a  ^^I  Ka  at  these  endpoints 
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Real 

Figure  4.11,  Open-Loop  System  Near  Pole  Locations  With  Ka  ~  0.01  and  Km  A  =  ^ma^„ 
of  their  ranges. 

As  Figure  4.11  shows,  the  instability  indicated  by  /i-analysis  is  not  fictitious.  The  system 
is  not  robustly  stable  for  the  full  range  of  values  the  parameters  can  inhabit.  If  all  six  K(j  are 
increased  in  unison  at  0.01  increments,  the  lowest  values  that  achieve  stability  are  0.05.  This 
value  brings  the  system  poles  just  to  the  left  of  the  jh;-axis.  This  configuration  is  shown  in 
Figure  4.12.  Interpreting  these  observations  indicates  the  SSAD  plant  is  a  system  that  is  stable 
at  moderate  to  high  torque  values  when  Ko  is  higher  than  0.05  and  unstable  at  very  low  torque 
values  when  K(}  is  below  0.05.  Such  a  system  would  stably  damp  and  then  limit-cycle  close  to 
the  equilibrium  position;  such  behavior  is  observed  in  the  actual  system. 

To  totally  eliminate  this  limit  cycle  behavior  with  a  regulator  may  require  a  large  sacrifice 
of  performance.  In  any  case,  it  would  be  misleading  to  think  that  eliminating  such  behavior 
with  a  regulator  on  our  model  would  do  so  for  the  actual  system.  Recall  that  in  Chapter  3  fric¬ 
tional  effects  were  neglected  in  SSAD.  Such  effects  would  certainly  contribute  to  the  limit  cycle 
behavior.  This  is  something  to  keep  in  mind  when  designing  new  compensators.  Instabilities 
caused  by  very  low  K{}  may  not  be  a  problem  since  they  indicate  slow  limit-cycling  of  small 
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Figure  4.12.  Open-Loop  System  Near  Pole  Locations  With  Ka  =  0.05  and  Kma  = 
amplitude  near  the  equilibrium  values  which  is  unavoidable. 

4.3  Summary 

In  this  Chapter,  the  24  state  SSAD  model  has  been  reduced  to  18  states  for  design  purposes 
only.  The  24  state  model  will  be  used  for  analysis  purposes.  Three  criteria  that  will  be 
considered  for  each  design  are  the  performance  singular  values,  the  control  effort  singular  values, 
and  robust  stability  of  the  plant.  The  analysis  process  was  carried  out  on  the  current  Position 
Hold  controller  as  both  an  example  of  the  process  and  the  baseline  for  comparison.  In  analyzing 
performance,  two  sets  of  singular  values  were  examined:  disturbance  to  end  effector  position  as 
well  as  end  effector  attitude.  Control  effort  was  determined  by  looking  at  the  singular  values  of 
the  transfer  function  from  disturbance  to  control  effort.  Finally,  robust  stability  was  determined 
via  the  small  gain  test,  and  the  less  conservative  structured  singular  value,  (i.  /i-analysis  may 
still  be  too  conservative  a  test  since  the  SSAD  plant  is  restricted  to  real  parametric  variations. 
To  determine  this,  we  resorted  to  eigenvalue  evaluation  at  certain  parameter  values. 
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Chapter  5 


LQR  Design 


Linear  Quadratic  Regulator  (LQR)  design  is  very  limited  in  its  applications.  This  technique 
not  only  requires  knowledge  of  every  state,  but  also,  this  knowledge  must  be  uncorrupted  by  any 
sensor  noise.  Since  measurements  of  every  state  are  not  available,  a  linear  quadratic  regulator 
cannot  be  implemented  on  the  actual  SRMS  without  first  designing  an  estimator  to  approximate 
the  states.  Nevertheless,  it  is  useful  to  design  a  LQR  because  it  gives  an  idea  of  the  best  we 
can  hope  to  achieve  in  terms  of  minimizing  the  ‘2-norm’  of  the  error,  HcHj.  It  is  a  simple 
first  step  that  helps  to  motivate  the  model-based  compensator  (MBC)  designs.  In  MBC  design 
techniques,  the  states  of  the  system  are  estimated  from  the  measurements.  These  estimates  are 
then  used  by  a  compensator.  MBC  design  techniques  are  the  topics  of  subsequent  Chapters. 

The  LQR  technique  is  derived  from  optimal  control  theory.  Kwakernaak  and  Sivan  [16] 
provide  the  full  details  of  this  theory.  Section  5.1  provides  the  main  results  of  the  LQR  technique 
as  it  applies  to  the  SRMS  problem. 

Section  5.2  analyzes  the  LQR  for  the  SRMS  using  the  techniques  presented  in  Chapter  4. 
The  performance  and  control  effort  singular  values  are  examined  and  compared  to  the  open- 
loop  system.  A  time  domain  simulation  of  a  typical  jet  firing  pattern  is  also  presented.  Finally, 
stability  robustness  is  examined  in  face  of  the  parametric  uncertainty. 

5.1  LQR  Theory 

LQR  theory  provides  no  direct  method  to  design  for  stability  robustness  or  robust  perfor¬ 
mance,  thus  for  design  purposes  Figure  51  eliminates  the  A  block  of  Figure  3-8.  The  effects  of 
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Figure  5.1.  Plant  With  Compensator 

the  A  block  are  considered  after  the  design  process  is  completed. 
The  general  state-space  representation  of  Figure  5.1  is 


Ap 

Bi 

Bi  ' 

^p 

e 

= 

c, 

£>11 

£>12 

d 

(5.1) 

y 

C2 

£>21 

£>22  _ 

u 

In  the  LQR  design  methodology,  uncorrupted  full  state  feedback  is  called  for.  Thus  the 
disturbance  vector,  d,  does  not  include  sensor  noise,  only  the  process  noise.  Furthermore,  C2 
is  the  identity  matrix.  We  will  design  the  LQR  compensator  from  the  reduced  order  model 
presented  in  the  last  Chapter.  For  analysis,  we  will  apply  the  reduced  order  compensator 
design  on  the  full  order  plant.  The  result  of  an  LQR  design  is  well  known  to  be  a  constant  gain 
matrix,  thus  an  LQR  does  not  increase  the  order  of  the  closed-loop  system. 

The  error  vector,  e,  includes  the  states  and  the  controls. 

c  = 

u 

The  states  are  weighted  by  a  symmetric  and  positive  semidehnite  matrix,  Q,  while  the  controls 
are  weighted  by  a  symmetric  and  positive  definite  matrix,  R.  Additionally,  the  modelling 
performed  in  Chapter  3  produced  no  direct  feedthrough  term,  thus  D22  =  0.  Hence  for  the 
LQR  formulation  of  the  SRMS,  Equation  5.1  can  be  recast  as 
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The  LQR  problem  is  to  devise  a  feedback  control  law  which  minimizes  the  quadratic  cost 
functional 

J  =  y  Qxp-\- u'  Rt^di  (5.3) 

The  optimal  feedback  control  law  is 

u  =  -Gxp 

G  =  R-'Bp'K  (5.4) 

where  K  is  the  symmetric,  positive  semidefinite  solution  of  the  algebraic  Riccati  equation 

Ap'  K  +  KAp  -  KBpR-' Bp'K  +  Q  =  0  (5.5) 

In  general,  there  are  many  solutions  to  Equation  5.5,  but  only  one  of  them  is  positive 
semidefinite.  The  gain  matrix  K  will  exist  and  be  positive  definite  provided  that  the  system 
is  stabilizable  and  detectable.  In  other  words,  any  uncontrollable  or  unobservable 
modes  are  asymptotically  stable. 

The  closed-loop  system  dynamics  become 

Xp  =  ApXpA  Bpu-ir  B^d 
u  =  -Gxp 

Xp  =  [y4p  -  BpG]  Xp  +  B^d  (5.6) 

Furthermore,  the  closed-loop  system  is  guaranteed  to  be  stable,  ie., 

ReA<  [Ap  -  BpG]  <  0  V»  (5.7) 

The  exact  pole  locations  will  depend  not  only  upon  the  open-loop  system  (through  Ap  and  Bp), 
but  also  upon  how  the  weighting  matrices  Q  and  R  are  selected.  Thus  Q  and  R  are  the  design 
parameters  that  are  tuned  until  a  LQR  design  meets  desired  disturbance  rejection  and  control 
effort  requirements. 
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5.2  LQR  Results 


For  the  SRMS  problem,  the  only  states  of  concern  are  the  ones  corresponding  to  the  end 
effector  position  and  attitude.  To  express  this  desire,  Q  is  selected  as  a  matrix  of  zeros  except  for 
the  values  along  the  diagonal  elements  corresponding  to  the  end  effector  position  and  attitude 
states. 

Recall  that  the  end  effector  position  is  measured  in  inches  and  the  attitude  is  measured  in 
radians.  To  scale  the  outputs,  consider  the  singular  values  of  the  open-loop  system  as  shown  in 
Figure  4.6.  The  attitude  maximum  singular  value  is  more  than  an  order  of  magnitude  below  the 
position  maximum  singular  value.  The  weights  in  Q  are  chosen  to  reflect  this  difference.  The 
elements  of  Q  corresponding  to  attitude  are  placed  more  than  an  order  of  magnitude  above  the 
position  weights.  In  this  manner  a  one  inch  error  is  not  considered  as  serious  as  a  one  radian 
error.  The  actual  values  of  the  weights  do  not  matter,  only  the  ratio  between  them.  Such  a 
selection  of  Q  also  insures  that  the  system  (Ap,  Bp,Qi)  is  stabilizable  and  detectable. 

In  selecting  R,  certainly  every  control  must  be  weighted  to  insure  the  joint  rate  limits  are 
not  violated.  As  shown  in  Table  4.2,  we  already  know  the  limit  of  control  demand  considered 
acceptable.  As  a  starting  point,  choose  the  inverse  of  the  joint  rate  limits  as  the  initial  weight 
and  then  iterate  over  the  design  process,  tuning  R  each  time  until  the  singular  values  from 
disturbance  to  control  effort  fall  just  below  the  joint  rate  limits  for  each  control  channel.  This 
procedure  will  result  in  the  best  performance  possible  with  the  LQR  design  technique  given  the 
limits  on  the  control  effort. 

Section  5.2.1  examines  the  control  effort  and  performance  singular  values  in  the  same  manner 
as  outlined  last  Chapter.  A  time  domain  simulation  is  included  as  well  to  help  interpret  the 
singular  values.  Section  5.2.2  examines  the  stability  robustness  of  the  LQR  design. 

5.2.1  LQR  Control  Effort  and  Performance 

Figure  5.2  shows  the  singular  values  from  disturbance  to  control  effort  after  the  iteration 
process.  Each  control  weight  is  fine  tuned  until  the  scaled  singular  value  falls  just  below 
one.  Below  0.1  radians/second,  the  control  effort  has  not  changed  much  from  the  open-loop 
system.  For  disturbances  occuring  at  1  radian/second,  the  LQR  system  uses  almost  two  orders 
of  magnitude  more  control  than  the  open-loop  system.  Nevertheless,  the  LQR  system  does  not 
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Figure  5.2.  Control  Effort  Singular  Values  of  the  LQR  System 
exceed  the  joint  rate  limits. 

Figure  5.3  presents  the  singular  values  from  disturbance  to  end  effector  position  and  attitude. 
Included  on  the  same  plots  is  the  corresponding  information  for  the  open-loop  system.  For  the 
LQR  design  we  were  able  to  meet  the  performance  goal  of  less  than  1  inch  excursion  in  position. 
This  is  expressed  by  the  maximum  singular  value  falling  below  one  for  all  frequencies.  Thus 
for  any  input  disturbance,  its  effect  on  the  end  effector  position  will  be  multiplied  by  a  gain  at 
least  less  than  one.  Since  the  input  disturbance  is  already  normalized  to  fall  between  -1  and 
1,  the  maximum  excursion  of  the  end  effector  position  is  less  than  one  inch.  Similarly,  since 
the  maximum  singular  value  for  the  disturbance  to  attitude  transfer  function  falls  below  10~^ 
we  have  guaranteed  less  than  0.01  radians  of  rotation  excursion.  Both  goals  are  met  without 
exceeding  the  joint  rate  limits. 

To  better  visualize  what  is  occuring  on  the  singular  value  plots,  a  time  domain  simulation 
is  presented  in  Figures  5.4  and  5.5.  These  figures  include  both  the  response  of  the  open-loop 
system  and  the  LQR  system.  In  this  simulation  a  roll  command  is  given  for  a  full  10  seconds 
immediately  followed  by  a  10  second  pitch  command  and  finally  a  10  second  yaw  command. 

From  Figure  5.4  we  can  see  that  the  X  direction  has  considerably  poorer  disturbance  re- 
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Magnitude  (PH;  dashed  LQR:  solid)  Magnitude  (PH:  dashed  LQR:  solid) 


Figure  5.3.  Performance  Singular  Values  of  the  LQR  System 


lacbet  (PH:  dashed  L(2R:  solid)  Inches  (PH:  dashed  LQR:  solid)  Inches  (PH:  dashed  IX^  solid) 


End  Effector  X  Positioa 


Time  (seconds) 


Figure  5.4.  Translational  Response  of  PH  System  and  LQR  System  to  Attitude  Thruster 


Commands 
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End  Effector  Altiiude:  lUdl 


Figure  5.5.  Rotational  Response  of  PH  System  and  LQR  System  to  Attitude  Thruster  Com¬ 
mand 


Frequency  (ladians/second) 

Figure  5.6.  Robust  Stability  Analysis  of  the  LQR  System 

jection  than  the  Y  and  Z  directions.  Nevertheless,  significant  improvement  over  the  uncom¬ 
pensated  system  is  also  apparent.  Similarly,  roll  disturbance  rejection  is  considerably  better 
than  the  other  axes  as  shown  in  Figure  5-5.  These  plots  indicate  that  while  in  HST  release 
configuration,  the  SRMS  can  reject  disturbances  much  easier  in  some  directions  over  others. 

5.2.2  LQR  Stability  Robustness 

Applying  both  the  small  gain  test  and  /r-analysis  to  the  closed-loop  LQR  system  results  in 
Figure  5.6.  Unfortunately  the  system  fails  both  tests.  Recall  as  outlined  last  Chapter  that  both 
tests  are  conservative  for  SSAD  since  the  uncertainties  are  real  parameter  variations.  Therefore 
it  is  possible  that  the  system  is  in  fact  robustly  stable  in  face  of  the  uncertainty  though  it 
cannot  be  shown. 

Last  Chapter,  to  confirm  the  results  of  /i-analysis,  we  evaluated  the  eigenvalues  of  the 
open-loop  system  for  different  parameter  values.  This  technique  conclusively  showed  that  the 
open-loop  system  was  in  fact  not  robustly  stable.  As  before,  we  can  evaluate  the  eigenvalues  of 
some  combinations  of  the  parameters  to  further  our  information  on  robust  stability. 
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Figure  5.7.  LQR  Closed-Loop  System  Near  Pole  Locations  for  the  Nominal  and  Off-Nominal 
System 

Consider  the  twelve  parameters  in  two  sets,  the  Km  a  and  the  Ka.  After  checking  the 
four  combinations  of  these  two  sets  at  the  endpoints  of  their  ranges,  as  we  discovered  with  the 
open-loop  plant,  the  worst  possible  combination  is  found  to  be  the  Km  a  at  the  top  of  their 
range  and  the  Ka  at  the  bottom  of  their  range.  The  worst  combination  is  defined  to  be  the 
system  in  which  the  closed-loop  poles  reside  closest  to  the  jcv-axis,  or  furthest  into  the  right 
half  plane.  Figure  5-7  presents  the  LQR  closed-loop  near  poles  of  the  nominal  plant  and  the 
worst  off-nominal  plant. 

For  these  combinations,  the  system  remains  stable.  Let  us  qualify  this  immediately;  this 
does  not  prove  robust  stability.  Though  experimentation  seems  to  indicate  that  the  combination 
of  the  Km  A  at  the  top  of  their  range  and  the  Ka  at  the  bottom  of  their  range  is  the  system’s 
worst,  we  cannot  prove  this  conclusively.  This  is  a  multi- variable  system  with  twelve  varying 
parameters.  There  can  be  some  other  combination  of  pa^^uneters  not  considered  that  will  result 
in  an  unstable  closed-loop  system. 

To  further  dampen  our  spirits,  keep  in  mind  that  several  nonlinearities  have  been  totally 
eliminated  in  the  development  of  SSAD.  Thus  even  if  robust  stability  is  guaranteed  on  the 


model,  it  is  not  guaranteed  on  the  actual  system.  We  will  not  bypass  the  extensive  pre-flight 
simulation  requirement  of  a  Shuttle  mission. 


5.3  Summary 

Using  the  LQR  technique,  we  have  designed  a  compensator  that  is  capable  of  meeting  the 
performance  objectives  within  the  allowable  control  effort.  The  resulting  regulator  fails  robust 
stability  analysis  via  both  the  small  gain  test  and  /i-analysis.  However,  actual  eigenvalue 
evaluation  with  four  combinations  of  the  parameters  at  their  endpoints  seems  to  indicate  that 
the  system  may  be  robustly  stable.  Unfortunately,  this  cannot  be  proven  unequivocally. 

There  are  several  drawbacks  to  the  LQR  technique.  This  technique  not  only  requires  knowl¬ 
edge  of  every  state,  but  also,  this  knowledge  must  be  uncorrupted  by  sensor  noise.  These 
requirements  render  an  LQR  design  impractical  for  implementation  of  the  actual  SRMS.  An 
estimator  would  be  required  to  determine  the  states  for  the  LQR.  This  would  add  noise  to  the 
state  feedback  vector  which  the  LQR  does  not  account  for.  If  an  estimator  must  be  designed, 
it  is  better  to  use  an  optimal  design  method  such  as  the  LQG  technique. 
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Chapter  6 


H2  Design 


Last  Chapter  we  saw  that  with  full  state  feedback,  performance  goals  were  met  without 
exceeding  the  control  effort  limitations.  However,  in  the  actual  system  we  are  limited  to  an 
observation  vector,  y,  which  is  some  linear  combination  of  the  states  corrupted  by  noise.  The 
solution  to  this  more  involved  problem  is  given  by  the  Linear  Quadratic  Gaussian  (LQG)  theory 
studied  extensively  in  the  1960’s  and  1970’s.  The  LQG  solution  is  realized  via  the  separation 
principle.  The  problem  is  reduced  to  two  sub-problems,  the  solutions  to  which  are  well  known. 

The  first  sub-problem  is  to  obtain  an  estimate,  x,  of  the  state,  x.  This  estimate  is  to  be 
optimal  in  the  sense  that  E  |(x  —  x)^  (x  —  x)|  is  minimized.  The  solution  to  this  problem  is 
given  by  Kalman  filter  theory.  Kalman  filters  have  the  structure  of  a  state  observer.  They 
receive  as  inputs  the  plant  input,  u,  and  plant  output,  y,  and  produce  as  an  output  the  state 
estimate  vector,  x. 

The  second  sub-problem  involves  using  the  state  estimate  from  the  Kalman  filter  as  if  it 
were  an  exact  measurement  of  the  states.  As  in  last  chapt  er,  we  solve  the  resulting  determin¬ 
istic  linear  quadratic  control  problem.  The  state  estimates  are  fed  through  the  resulting  state 
feedback  matrix,  thus  producing  the  control  for  the  complete  system.  The  structure  of  the  total 
LQG  compensator  is  a  Kalman  filter  in  series  with  a  state  feedback  matrix  [12]. 

A  consistent,  but  more  suave  term  for  the  LQG  problem  is  the  H2  problem.  This  nomen¬ 
clature  arises  because  solving  the  LQG  problem  is  analogous  to  the  minirruzation  of  the  H2 
norm  of  the  transfer  function  from  exogenous  inputs  to  the  error  vector.  Section  6.1  develops 
the  H2  theory  in  this  fashion,  following  the  work  of  Reference  [17].  This  approach  is  used 
over  the  traditional  presentation  of  the  LQG  problem  because  it  utilizes  the  two  block  format 
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Figure  6.1.  Plant  With  Compensator 

we  used  in  previous  Chapters.  Aside  from  visualizing  the  problem  in  a  simpler  manner,  this 
approach  helps  motivate  the  Hgo  problem  in  the  culminating  Chapter.  The  Hgo'  problem  can 
be  developed  parallel  to  this  approach. 

Section  6.2  analyses  the  results  of  the  design  using  the  same  criteria  as  before.  The 
singular  values  from  disturbance  to  control  effort  and  disturbance  to  the  end  effector  states  are 
examined  and  compared  to  the  open-loop  system.  A  time  domain  simulation  of  the  same  jet 
firing  pattern  used  before  is  also  presented.  Finally,  stability  robustness  of  the  closed-loop  Hi 
system  is  examined  including  the  parametric  uncertainty. 

6.1  H2  Theory 

Recall  the  general  block  diagram  of  the  plant,  but  with  the  A  block  removed,  as  shown 
in  Figure  6.1.  This  is  the  plant  with  compensator  where  u  and  y  are  the  control  inputs  and 
observation  vector  respectively,  d  represents  all  exogenous  inputs  to  the  system,  d  includes 
process  and  sensor  noise  for  the  Hi  problem,  e  is  the  vector  of  signals  whose  mean  squared 
value  we  wish  to  minimize.  As  before  we  chose  it  to  contain  the  control  effort  and  end  effector 
states. 

c  = 

'  and  are  both  TTardy  aparea 
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P(fi)  is  derived  from  the  nominal  plant  model.  Suppose  that  P{s)  is  partitioned  as  follows: 


c 

■^'1  (*)pj  Xmi 

xm3 

d 

y 

P2I  (^)pjXmi 

U 

We  can  eliminate  u  and  y  using  u  =  Ky,  to  obtain 

e=[Pn  +  PnKil  -  P22K)-'  Pi,]  d  (6.2) 

It  is  convenient  to  adopt  the  shorthand  for  the  linear  fractional  transformation  of  P  and  K 

Pt(P,K)  =  Pu  +  P,2K{I  -  P22K)-' P2,  (6.3) 


so  that  we  can  write 

e  =  J^fiP,K)d  (6.4) 

A  simple  way  to  state  the  H2  problem  is  to  minimize  \\Tt  {P,  A^)||2  for  stabilizing  K  over  all 
^t  {P,K)  €  Hi-  The  Hi  norm  is  defined  in  the  frequency  domain  for  a  stable  transfer  function 
matrix,  G(s)  as 


\\G\\i  s  /_  J^^ce  \Gi-ju>)'  G{ju>)]  du>y  (6.5) 

Recall  that  the  general  state-space  representation  of  Figure  6.1  is 


Ap 

B, 

Hi  ' 

€ 

= 

c, 

Du 

Dm 

d 

(6.6) 

y 

Ci 

Di\ 

Dii 

u 

In  order  to  insure  stabilizing  controllers  exist,  the  uncompensated  plant,  (Ap,  Bi,Gi),  as 
well  as  (Ap,  B, ,  Ci)  must  be  stabilizable  and  detectable.  The  SSAD  plant  meets  this  condition 
and  we  must  choose  the  error  vector  and  its  weights  so  that  it  too  meets  this  condition.  In  order 
to  ensure  the  realizability  of  resulting  controllers  we  must  also  choose  the  weights  on  the  error 
vector  so  that  rank(Di2)  =  mj  and  rank(D2i)  =  pi  where  m2  and  pi  are  given  in  Equation  6.1. 

Define  the  following  transformation  variables 


z,  =  (d,2''D,2)"^  (6.7) 

Zi  =  ^DiiDii^^ 
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and  apply  them  to  the  plant  as  follows 


B2  — 

Cl  =  Z2C2 

D\2  =  DmZ\ 


(6.8) 


Let  us  assume  these  transformations  have  already  been  applied,  thus  D12  &nd  D21  have 
been  normalized  so  that 


D,2  =  [0  /]■''  and  D21  =  [0  /]  (6.9) 

The  point  of  these  transformations  is  that  we  can  reduce  the  complexity  of  the  controller 
formulas  as  long  as  we  can  write 


0,2'' [C,  D,2]  =  [0  /]  (6.10) 

and 

(6.11) 

Equation  6.10  means  that  the  penalty  on  c  =  Ci  Xp■\■D^^u  includes  a  normalized,  nonsingular 
penalty  on  u.  In  other  words,  the  error  vector  includes  the  control  inputs  with  an  identity 
weighting  matrix  and  there  is  no  cross  weighting  between  the  states  and  the  control. 

Equation  6.11  is  dual  to  Equation  6. 10.  This  equation  flows  from  assuming  that  the  dis¬ 
turbance  vector,  d,  includes  both  the  process  noise  on  the  states  and  the  sensor  noise  on  the 
measurements.  These  noises  are  separate  and  the  sensor  noise  weighting  matrix  is  normalized 
and  nonsingular. 

The  transformations  in  Equation  6.7  allow  us  to  choose  non-normalized  weights  for  D12  and 
D21  and  then  scale  the  plant  so  that  Equations  6.10  and  611  still  hold. 


5. 

D21 


D21 '  = 
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Two  additional  assumptions  made  on  the  plant  are  that  Dn  =  0  and  D22  =  0.  We  require 
the  former  for  the  H2  cost  to  be  finite;  the  latter  simplifies  the  formulas.  For  SSAD  D22  =  0 
and  we  chose  the  error  vector  to  not  include  the  disturbance  hence  Dn  =0. 

Thus  the  form  of  the  system  becomes 


Ap 

Si 

82 

e 

= 

c, 

0 

£>12 

d 

(6.12) 

y 

C2 

£>21 

0 

u 

with 

D,2  =  [0  /]'■  and  D21  =[0  7] 


Before  proceeding,  let  us  define  some  notation.  Let  A,  Q,  and  R  be  real  n  x  n  matrices.  Q 
and  R  are  both  symmetric.  Define  the  Hamiltonian  matrix,  H,  as 


H 


A  R 
Q  -A'l' 


(6.13) 


The  definitions  of  the  symbols  dom(Ric)  and  Ric(//)  are  that  ii  H  £  dom(Ric)  and  X  =  Ric(//) 
then 

(i)  X  is  symmetric. 

(ii)  X  satisfies  the  algebraic  Riccati  equation 

A'  X  +  XA  +  XRX  -Q  =  0 


(iii)  A  +  RX  is  stable. 

In  order  to  find  an  admissible  controller  K  which  minimizes  ||.7v  (P,  7f)||2,  we  must  solve 
the  following  two  Riccati  equations  given  by  their  Hamiltonian  matrices 


X2  =  Ric 

Ap 

-Ci'Ci 

-B2B2'' 

-Ap'’ 

(6.14) 

Y2  =  Ric 

Ap'' 

-Si  Si'’ 

-C2''C2 

-Ap 

(6.15) 
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The  two  solutions,  X2  and  Vbi  are  positive  semidehnite.  Using  these  solutions,  define 


F2  = 

-82^X2 

(6.16) 

L2  = 

-Y2C2'' 

(6.17) 

Notice  that  if  the  control  weighting  matrix  is  normalized  to  identity  as  we  have  done,  F2  is 
the  optimal  state  feedback  gain  as  solved  for  last  in  the  Chapter  with  full  state  feedback.  L2 
represents  the  solution  to  the  Kalman  filter  gain  matrix.  Now  we  may  write  the  unique  optimal 
controller  as 


K(s)  := 


Ap  -k  B2F2  -|-  L2C2 

-L2 

F2 

0 

(6.18) 


Considering  that  our  plant  is  square,  (six  inputs  and  six  outputs),  we  have  an  invertible 
system  A  matrix,  and  that  the  non-minimum  phase  transmission  zeros  are  fast  relative  to  the 
rest  of  the  system,  it  is  tempting  to  apply  a  loop-shaping  technique.  The  Linear  Quadratic 
Gaussian/Loop  Transfer  Recovery  (LQG/LTR)  method  presented  in  References  [12]  and  [13]  is 
a  loop-shaping  technique  that  allows  us  to  shape  the  singular  values  at  the  input  or  output  of 
the  plant. 

Unfortunately,  in  the  LQG/LTR  methodology,  the  ways  in  which  the  process  noise  and 
sensor  noise  enter  the  plant  are  treated  as  tuning  parameters  rather  than  as  representations 
of  the  real  plant  behavior.  In  terms  of  Equation  6.6,  B-\  and  i>2i  become  design  parameters 
which  are  adjusted  along  with  the  weights  on  the  states  and  the  controls  to  shape  the  singular 
values.  For  SSAD  the  process  noise,  Shuttle  jet  firings,  does  enter  the  system  in  a  prescribed 
manner.  LQG/LTR  can  still  be  applied  on  SSAD,  only  it  is  not  possible  to  directly  shape  the 
singular  values  of  interest.  After  initial  attempts  at  applying  LQG/LTR  it  became  apparent 
that  conventional  H2  allows  a  more  direct  approach  the  problem  since  and  D21  are  physical 
parameters. 
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6.2  H2  Results 


In  the  H2  problem,  the  design  parameters  are  the  choice  of  weights  on  the  contents  of  the 
error  vector,  y  and  u.  In  SSAD,  since  y  corresponds  to  the  end  effector  states,  we  can  choose  the 
C^  weights  as  we  selected  Q  when  designing  a  LQR.  We  multiply  each  channel  corresponding 
to  an  attitude  by  an  order  of  magnitude  to  account  for  the  difference  between  position  and 
attitude  singular  values  in  the  open-loop  system.  In  this  manner  a  one  inch  error  is  not  treated 
as  serious  as  a  one  radian  error  by  the  H2  controller.  This  choice  is  only  an  initial  guess  and 
can  be  modified  to  bring  the  closed-loop  system  within  performance  objectives. 

The  weights  on  the  control  effort,  Z?i2,  are  initially  placed  at  the  inverse  of  the  Joint  rate 
limits  as  in  the  LQR  design.  Again,  this  design  parameter  is  tuned  until  a  satisfactory  design 
is  achieved.  One  thing  to  note,  the  final  weights  of  the  H2  design  do  not  have  to  correspond 
to  the  final  weights  of  the  LQR  design.  Recall  the  differences  between  the  two  design  plants; 
LQR  assumes  full  state  feedback  uncorrupted  by  noise  while  the  H2  compensator  has  only  the 
observation  vector  which  is  also  corrupted  by  sensor  noise. 

Section  6.2.1  examines  the  control  effort  and  performance  singular  values  of  the  H2  design. 
A  time  domain  simulation  driven  by  the  same  jet  firing  pattern  as  used  on  the  LQR  is  included 
as  well  to  help  interpret  the  singular  values.  Section  6.2.2  examines  the  stability  robustness  of 
the  H2  design. 

6.2.1  H2  Control  Effort  and  Performance 

Figure  6  2  shows  the  singular  values  from  disturbance  to  each  individual  control  effort 
channel  after  tuning  the  weights  on  the  error  vector.  As  before,  each  control  weight  is  fine 
tuned  until  the  scaled  singular  value  falls  just  below  one.  Notice  that  two  of  the  control  effort 
singular  values  actually  lie  below  one.  These  are  the  singular  values  corresponding  to  elbow 
pitch  and  wrist  pitch  joints. 

During  the  design  process,  if  a  particular  control  effort  singular  value  fell  below  one,  the 
corresponding  weight  was  reduced  allowing  the  compensator  to  use  more  control  in  that  channel. 
Conversely,  if  a  control  effort  singular  value  fell  above  one,  the  corresponding  penalty  was 
increased.  For  the  elbow  pitch  and  wrist  pitch  joint  rates,  the  penalties  were  successively  reduced 
up  until  the  weighting  matrix,  started  to  numerically  fail  the  rank  tests.  Nevertheless,  the 
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Figure  6.2.  Control  Effort  Singular  Values  of  the  H2  System 

compensator  chose  not  to  use  more  control  in  these  channels. 

There  are  two  explanations  for  this  result.  First,  the  elbow  pitch  and  wrist  pitch  joint  rates 
may  have  limited  ability  to  affect  the  performance  objectives.  Consider  the  simple  example  of  a 
body  with  horizontal  and  vertical  velocity  as  its  controls.  If  we  choose  as  performance  objectives 
vertical  position  only,  a  controller  would  have  no  reason  to  use  the  horizontal  control  channel. 
This  seems  unlikely  for  SSAD  since  clearly  from  Figure  6.2  it  is  using  some  control  in  these 
channels.  If  the  penalty  is  made  less  on  these  channels,  it  should  use  more  control  even  if  the 
benefit  is  slight.  A  more  likely  explanation  is  that  use  of  the  elbow  pitch  and  wrist  pitch  joint 
rates  is  penalized  through  some  coupling  in  the  system.  Use  of  elbow  pitch  may,  for  instance, 
reduce  the  error  in  the  X  direction  but  cause  large  excursions  in  Y  which  would  require  use 
of  shoulder  pitch  to  compensate.  Thus  the  weight  on  shoulder  pitch  indirectly  penalizes  elbow 
pitch  as  well. 

Figure  6.3  presents  the  singular  values  from  disturbance  to  end  effector  position  and  attitude. 
Included  on  the  same  plots  is  the  corresponding  information  for  the  open-loop  system.  For  the 
H2  design  we  are  barely  able  to  meet  the  performance  goal  of  less  than  1  inch  excursion  in 
position.  We  are  able  to  meet  the  attitude  goal  of  less  than  0.01  radians  of  rotation  excursion 
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Singular  Values  from  Disturbance  to  End  Effector  Position 


Figure  6.3.  Performance  Singular  Values  of  the  H2  System 


without  problem.  Both  goals  are  met  without  exceeding  the  joint  rate  limits. 

As  with  the  LQR  design,  to  help  visualize  what  is  occuring  on  the  singular  value  plots,  a 
time  domain  simulation  is  presented  in  Figures  6.4  and  6.5.  These  Figures  include  both  the 
response  of  the  open-loop  system  and  the  system.  As  before,  a  roll  command  is  given  for 
a  full  10  seconds  immediately  followed  by  a  10  second  pitch  command  and  finally  a  10  second 
yaw  command. 

From  Figure  6.4  we  can  see  that  as  in  the  LQR  design,  the  X  direction  has  considerably 
poorer  disturbance  rejection  than  the  Y  and  Z  directions  for  the  H2  compensator.  Similar 
improvement  over  the  uncompensated  system  as  in  the  LQR  design  is  also  apparent.  For  the 
attitude  directions,  roll  disturbance  rejection  is  considerably  better  than  the  other  axes  as  shown 
in  Figure  6.5.  Overall,  the  resulting  design  performance  is  only  slightly  below  the  full  state 
feedback  LQR  design. 

6.2.2  H2  Stability  Robustness 

Applying  both  the  small  gain  test  and  /i-analysis  to  the  closed-loop  i/j  system  results  in 
Figure  6.6.  As  before,  the  system  fails  both  stability  robustness  tests.  With  the  LQR  plant,  we 
evaluated  the  eigenvalues  of  the  closed-loop  system  for  four  sets  of  the  parameters  and  showed 
that  the  system  was  stable  for  these  endpoints.  Though  this  is  not  conclusive  proof,  it  suggests 
that  it  is  possible  that  the  LQR  closed-loop  system  is  indeed  robustly  stable. 

Evaluating  those  same  sets  of  parameters  for  the  closed-loop  H2  design  does  show  conclu¬ 
sively  that  the  system  is  robustly  unstable.  Figure  6.7  shows  the  near  pole  locations  for  the 
nominal  closed-loop  H2  system.  The  following  four  Figures  show  the  closed-loop  poles  for  the 
four  sets  of  perturbed  parameters.  Figure  6.8  places  both  the  Km  a  &nd  the  Ka  at  their  mini¬ 
mum  values.  Figure  6.9  places  both  sets  of  parameters  at  their  maximum  values.  Figure  6.10 
places  the  Km  a  at  their  minimum  values  and  the  Ka  at  their  maximum  values.  Finally,  Fig¬ 
ure  6.11  considers  the  last  permutation  of  the  Km  A  at  their  maximum  values  and  the  K(;  at 
their  minimum  values. 

Aside  from  the  nominal  plant,  the  closed-loop  system  is  only  stable  for  Figure  6.9.  In 
Chapter  4  when  considering  the  open-loop  plant,  we  were  able  to  isolate  the  stability  problem 
to  when  the  system  is  operating  at  very  low  amplitudes.  In  this  region,  K ma  is  at  its  maximum 
and  K(j  is  near  its  minimum.  For  the  H2  system  we  cannot  draw  the  same  conclusions.  Stability 
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Figure  6.4.  TVansIational  Response  of  PH  System  and  H2  System  to  Attitude  Thruster 


Commands 


90 


Figure  6.5.  Rotational  Response  of  PH  System  and  H2  System  to  Attitude  Thruster  Com¬ 


mands 
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H2:  Small  Gain  Test  (solid)  Mu-Analysis(dashed) 


Figure  6.6.  Robust  Stability  Analysis  of  the  H2  System 


H2  Closed-Loop  Near  Poles  for  Nominal  Plant 
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Figure  6.7.  Closed-Loop  Poles  of  the  Nominal  H2  System 


Figurn  6.9.  Closed-Loop  Poles  of  the  Off-Nominal  H-i  System;  2 
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Figure  6.10.  Closed-Loop  Poles  of  the  Off-Nominal  i/j  System:  3 
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H2  Closed-Loop  Near  Poles  for  Nominal  Plant  (Kma  at  max,  Kg  at  min) 
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Figure  6.11.  Closed-Loop  Poles  of  the  Off-Nominal  System:  4 
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is  a  problem  for  the  closed-loop  system  in  several  configurations.  If  the  stability  problem  was 
caused  by  neglected  dynamics,  a  solution  to  robust  stability  would  be  to  reduce  the  bandwidth 
of  the  system.  This  could  be  accomplished  by  either  further  penalizing  the  control  effort  or 
reducing  the  weights  on  the  end  effector  states,  ie.,  demand  less  performance.  Unfortunately, 
with  parametric  uncertainty  the  low  frequency  system  poles  simply  aren’t  where  the  controller 
is  expecting  them.  Reducing  the  bandwidth  will  not  solve  our  problem.  We  will  explore  this 
issue  in  further  detail  when  considering  the  Hoc  design. 

6.3  Summary 

This  Chapter  removes  the  requirement  for  full  state  feedback  uncorrupted  by  noise  and 
presents  the  H2  (or  LQG)  approach.  This  approach  involves  designing  a  Kalman  filter  to 
estimate  all  of  the  system  states  from  the  observation  and  control  vector  and  then  using  these 
estimates  as  perfect  state  information  fed  into  a  LQR. 

On  the  nominal  plant,  the  H2  approach  is  able  to  recover  performance  comparable  to  the 
LQR  design.  Both  performance  objectives  are  met  without  exceeding  the  limits  on  the  control 
effort.  Unfortunately,  the  closed-loop  H2  system  is  not  robustly  stable  in  face  of  the  parametric 
uncertainty  present  in  the  plant  as  indicated  by  both  /r-analysis  and  the  small  gain  test.  Though 
these  tests  are  both  conservative  for  real  parametric  uncertainty,  eigenvalue  analysis  shows 
conclusively  that  the  system  is  not  robustly  stable. 
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Chapter  7 


Hoo  Design 


In  the  last  Chapter  we  designed  a  compensator  that  meets  performance  specifications  within 
allowable  control  effort  specifications  by  minimizing  the  H2  norm  of  the  transfer  function  from 
exogenous  inputs  to  the  error  vector.  This  Chapter  solves  a  similar  problem,  only  this  time 
performance  is  measured  with  an  Hoc  norm.  The  Hoc  norm  is  defined  in  the  frequency  domain 
for  a  stable  transfer  function  matrix  as 


\\Gis)\\co  = 


Using  this  norm,  an  Hoc  optimization  design  technique  will  result  in  the  compensator  that 
minimizes  the  supremum  over  ail  frequencies  of  the  maximum  singular  value  of  the  transfer 
function  of  interest. 

Recall  that  the  disturbances  acting  on  the  SRMS  are  caused  by  Shuttle  attitude  thruster 
firings.  The  Shuttle  may  fire  its  thrusters  in  pulses  every  80  milliseconds,  or  for  steps  lasting  as 
long  as  200  seconds.  Such  a  firing  profile  has  the  ability  to  excite  the  system  at  any  frequency 
up  through  its  bandwidth.  This  makes  the  Hoc  norm  a  natural  selection  for  optimization.  We 
would  like  to  reduce  the  worst  case  gain  for  disturbances  of  the  SRMS  at  all  frequencies. 

There  are  many  parallels  between  the  H2  and  Hoo  approaches.  Development  of  the  Hoo 
theory  in  Section  7.1  flows  naturally  from  the  work  of  last  Chapter.  As  in  the  H2  case,  we 
arrive  at  two  algebraic  Riccati  equations.  The  Hoo  Riccati  equations  can  be  thought  of  as 
those  that  arise  from  a  linear  quadratic  differential  game  between  two  opposing  players.  One 
player  chooses  the  disturbance  to  maximize  the  norm  of  the  output  while  the  opposing  player 


sup  ^[GOa;)] 

sun  Mi 
ll«l|j?40  ii«l|2 


(7.1) 
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Figure  7.1.  Plant  With  Compensator 

chooses  the  control  input  to  minimize  it.  The  work  in  this  Section  follows  that  of  Reference  [17]. 
Though  not  presented,  the  Hoc  design  technique  can  also  be  derived  by  recasting  the  system  as 
a  model-matching  problem  via  the  parametrization  of  all  stabilizing  controllers.  An  exposition 
of  this  technique  is  presented  in  Reference  [18]. 

Section  7.2  analyzes  the  results  of  the  Hoa  design  using  the  same  criteria  as  in  previous 
Chapters.  The  singular  values  from  disturbance  to  control  effort  and  disturbance  to  the  end 
effector  states  are  examined  and  compared  to  the  open-loop  system.  To  highlight  the  differences 
in  optimizing  the  norm  versus  the  //<»  norm,  a  comparison  of  these  two  designs  is  presented 
as  well.  As  before,  a  time  domain  simulation  of  the  same  jet  firing  pattern  is  presented. 
Finally,  stability  robustness  of  the  closed-loop  Hao  system  is  examined  in  face  of  the  parametric 
uncertainty.  By  choosing  a  different  transfer  function  to  optimize,  namely  the  output  of  the 
plant  uncertainties  to  the  input  of  the  plant  uncertainties,  we  can  determine  if  any  Hoc  controller 
would  be  able  to  robustly  stabilize  the  plant. 


7.1  H(xi  Theory 


For  the  Hoc  design  we  will  use  the  same  general  two-block  diagram  as  shown  in  Figure  7.1. 
Define  the  error  vector  the  same  as  for  the  design,  namely 


c  = 


y 


rJ 

Using  the  same  notation  as  in  the  last  Chapter,  write  the  transfer  function  from  the  exogenous 
inputs  to  the  error  vector  as 
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e  =  :F,{P,K)d  (7.2) 

The  Hoo  optimal  compensator  is  the  compensator  that  minimizes  the  supremum  over  all 
frequencies  of  the  m20cimum  singular  value  of  ^/(P, /^)  for  stabilizing  K.  This  can  be  written 
as 


^ll^/(/’,A^)lloc=  ,  min  ^8upF(/-,(P,A:)(iu;))  (7 

A  stabilizing  K  w 

In  practice  however,  contrary  to  the  problem,  Hoc  optimal  controllers  are  more  difficult  to 
characterize  than  suboptimal  ones.  We  settle  for  an  admissible  K  such  that  ||.P/ (P,  If  )||ao  ^  f- 
Before  characterizing  the  Hoo  suboptimal  controllers,  consider  the  transfer  function  matrix 


min 

stabilizing 


of  Figure  7.1  to  be  in  the  same  form  as  the  H2  problem.  That  is 


with 
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Dn  =  (0  /]■'■  and  D21  =  [0  7] 


(7.4) 


Hoo  theory  requires  that  the  following  must  hold  true  for  the  system: 


(a)  (i4,P|,Ci)  is  stabilizable  and  detectable. 


(b)  {A,  82,02)  is  stabilizable  and  detectable. 


(c)  Du’'[Cy  D,2]  =  (0  /]. 


(d) 


fli 

Dll 


Dj 


'/■ 


0 

/ 


The  weights  on  the  states  in  the  error  vector,  e,  must  be  chosen  to  insure  (a)  is  met.  As 
shown  in  Chapter  4,  (b)  is  known  to  be  true  for  SSAD.  To  meet  conditions  (c)  and  (d)  it  will  be 
necessary  to  apply  the  same  transformations  on  the  system  used  in  the  H2  problem  as  shown 
in  Section  6.1.  Implicit  in  the  given  state-space  realization  for  Figure  7.1  is  that  Dn  =  0 
and  D22  =  0.  In  a  general  H^^,  problem  these  conditions  can  be  relaxed,  though  it  would 
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complicate  the  formulas  presented  here  considerably.  A  treatment  of  the  more  general  problem 
can  be  found  in  Reference  [19]. 

The  Hoo  suboptimal  problem  is  to  find  an  admissible  K  such  that  T- 

solution  involves  two  Hamiltonian  matrices  as  in  the  problem 


^’oo  = 


y*  = 


-Ci'C,  -Ap'' 

Ap''  -Cj'Cj 

-B,Bj  -Ap 


(7.5) 

(7.6) 


These  Hamiltonians  are  remarkably  similar  to  the  formulation.  The  main  difference  is 
that  the  additional  terms  in  the  upper-right  entry  of  both  Foe  and  Joe  make  the  two  Hamil¬ 
tonians  no  longer  sign  definite.  Therefore  we  cannot  guarantee  that  either  Foe  or  Joo  has  a 
Riccati  solution  or  that  the  solution  is  positive  definite.  In  fact,  this  limitation  is  related  to  the 
existence  of  a  Hoo  suboptimal  controller. 

In  Reference  [17],  Doyle  et  al  prove  that  there  exists  an  admissible  controller,  K,  such  that 
(P, /f)||j^  <  y  iff  the  following  conditions  hold  true 

(i)  Foe  €  dom(Ric)  and  Xoo  •—  Ric(Foo)  >  0. 

(ii)  Joo  €  dom(Ric)  and  y<»  :=  Ric(Joc)  >  0. 

(iii)  piXooYoo)  <  7*  where  p{M)  is  the  spectral  radius  of  M. 

When  all  three  of  these  conditions  hold,  one  such  suboptimal  controller  is 


where 


^*«*(<) 


■^00 

Boo 

Coo 

0 

(7.7) 


Aoo  =  Ap  +  "f~^B^  Bi '  Xoo  +  B2C00  —  B00C2  (7 .8) 

Boo  =  (/-7"’V'ooA-«,)”V«,C2''‘ 

Coo  =  -B2''Xoo 


99 


Conditions  (i)  through  (iii)  suggest  the  following  method,  known  as  7  iteration,  to  calculate 
an  Hoo  ‘optimal’  controller:  select  a  positive  number  7;  test  conditions  (i)  through  (iii);  increase 
7  if  failed  or  decrease  7  if  passed;  repeat  until  within  a  desired  tolerance. 

To  illustrate  the  relationship  between  the  Hoa  &nd  H2  solutions,  notice  that  as  7  — »  00, 
Xao  —*  X2,  etc.,  and  Kao  ^2-  This  can  be  seen  directly  from  the  two  sets  of  Hamiltonians 
in  the  Hoo  and  H2  problem  formulations;  as  7  — »  00  the  Hoc  Hamiltonians  become  the  H2 
Hamiltonians. 


7.2  H(X)  Results 

As  in  the  H2  problem,  the  design  parameters  of  the  Hoo  problem  are  the  choice  of  weights 
on  the  contents  of  the  error  vector,  namely  y  and  u.  In  design,  select  the  C^  weights  for  the  Hoo 
problem  as  before  when  designing  both  the  H2  and  LQR  controllers.  That  is,  multiply  each 
channel  corresponding  to  an  attitude  by  an  order  of  magnitude  to  account  for  the  difference 
between  position  and  attitude  singular  values  of  the  open-loop  system.  This  is  to  ameliorate 
the  difference  between  a  one  inch  error  and  a  one  radian  error. 

The  weights  on  the  control  effort,  D12,  are  treated  as  the  main  tuning  parameter.  The 
weight  on  each  control  channel  is  increased  or  decreased  appropriately  until  every  control  effort 
singular  value  does  not  exceed  the  corresponding  joint  rate  limit.  The  final  weights  of  the  Hoo 
design  need  not  correspond  to  the  weights  of  the  H2  design  since  a  different  norm  is  being 
optimized. 

Section  7.2.1  examines  the  control  effort  and  performance  singular  values  of  the  Hgo  design. 
The  performance  singular  values  are  compared  not  only  to  the  open-loop  plant  but  also  to  the 
H2  design  in  order  to  highlight  the  differences  in  the  two  design  techniques.  A  time  domain 
simulation  driven  by  the  same  jet  firing  pattern  used  on  the  other  designs  is  included  as  well  to 
help  interpret  the  singular  values.  Section  7.2.2  examines  the  stability  robustness  of  the  Hoo 
design.  By  choosing  a  different  transfer  function’s  Hoo  norm  to  minimize,  we  can  determine  if 
it  is  possible  to  design  a  robustly  stable  compensator. 

7.2.1  Hoo  Control  Effort  and  Performance 


Figure  7.2  shows  the  singular  values  from  disturbance  to  control  effort  after  tuning  the 


Figure  7.2.  Control  Effort  Singular  Values  of  the  Hoc  System 

weights  on  the  error  vector.  As  before,  each  control  channel  weight  residing  in  Di2  is  fine  tuned 
until  the  scaled  singular  value  falls  just  below  one. 

In  the  H2  solution,  the  controller  did  not  make  full  use  of  every  control  channel,  specifically 
elbow  pitch  and  wrist  pitch.  For  the  Hoc  design  a  different  norm  is  designed  for,  the  Hoc  norm, 
that  is  the  supremum  over  all  frequencies  of  the  maximum  singular  value,  not  the  area  under 
the  singular  value  curves  as  in  the  H2  case.  The  Hoc  controller  is  able  to  make  use  of  each 
control  to  its  full  potential  since  it  is  penalizing  the  control  channels  in  a  different  way. 

Figure  7.3  presents  the  singular  values  from  disturbance  to  end  effector  position  and  attitude. 
Included  on  the  same  plots  is  the  corresponding  information  for  the  open-loop  system.  The 
Hoc  design  is  able  to  meet  the  performance  goals  of  less  than  1  inch  excursion  in  position  and 
less  than  0.01  radians  of  rotation  excursion  without  exceeding  the  joint  rate  limits. 

Figure  7.4  compares  the  singular  values  from  disturbance  to  end  effector  position  and  at¬ 
titude  for  both  the  Hoc  &nd  H2  systems.  Because  the  Hqc  design  is  trying  to  minimize  the 
Hoc  norm,  it  has  sacrificed  some  performance  at  2  rps  so  that  the  maximum  singular  value  can 
be  brought  down.  The  difference  between  the  two  designs  is  not  as  dramatic  as  can  be  seen 
in  other  systems  since  the  H2  design  does  not  contain  any  large  resonances,  or  ‘hills,’  in  its 
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Magnitude  (PH:  dashed  H-inf:  solid)  Magnitude  (PH:  dashed  H>inf:  solid) 


Figure  7.3.  Performance  Singular  Values  of  the  hoc  Position  Hold  Systems 
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Magnitude  (H2:  dashed  H-inf:  solid)  Magnitude  (H2:  dashed  H-inf:  solid) 


Figure  7.4.  Performance  Singular  Values  of  the  f/oo  and  ffj  Systems 
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maximum  singular  value  to  begin  with,  it  is  already  flat.  The  Hqo  design  lowers  the  maximum 
singular  value  for  both  position  and  attitude  at  the  expense  of  some  of  the  other  directions. 

The  presence  of  the  control  effort  in  the  error  vector  acts  like  an  Hi  constraint  on  the  Hex 
design  limiting  the  bandwidth  of  the  closed-loop  system.  The  Hex  design  sans  control  effort  in 
the  error  vector  would  have  a  theoretically  infinite  bandwidth;  theoretical  because  even  if  we 
accept  exceeding  the  joint  rate  limits  to  increase  the  bandwidth,  the  actual  system  will  run  into 
physical  constraints  on  the  control  effort. 

As  with  the  previous  designs.  Figures  7.5  and  7.6  present  a  time  domain  simulation.  As 
before,  a  roll  command  is  given  for  a  full  10  seconds  immediately  followed  by  a  10  second  pitch 
command  and  finally  a  10  second  yaw  command.  These  Figures  include  the  responses  of  the 
Hex  system  as  well  as  both  the  open-loop  and  the  Hi  system. 

From  Figure  7.5  it  is  apparent  that  as  in  the  previous  designs,  the  Hex  compensator’s 
X  direction  has  poorer  disturbance  rejection  than  the  Y  and  Z  directions.  For  the  attitude 
directions,  roll  disturbance  rejection  is  better  than  the  other  axes  as  shown  in  Figure  7.6. 
Compared  to  the  Hi  system,  the  Hex  system  is  better  in  some  directions  and  slightly  worse 
in  others.  As  expected,  the  worst  case  direction  is  better  for  the  Hex  design,  namely  X  for 
the  position  directions  and  pitch  for  the  attitude  directions.  These  results  confirm  the  singular 
value  plots  that  show  the  Hex  design  offers  a  better  worst  case  direction  at  the  expense  of 
some  of  the  other  directions.  Nevertheless,  both  controllers  are  adequate  to  achieve  our  desired 
performance  objectives. 

7.2.2  Hex  Stability  Robustness 

Figure  7.7  shows  the  results  of  applying  both  the  small  gain  test  and  //-analysis  to  the  closed  - 
loop  Hex  system.  As  with  every  other  design,  the  Hex  system  fails  both  stability  robustness 
tests.  Experience  with  the  other  designs  in  previous  Chapters  show  that  these  tests  may  be 
poor  indicators  of  the  system’s  true  robust  stability. 

Eigenvalue  analysis  of  both  the  open-loop  system  and  the  Hi  design  show  conclusively 
that  these  two  systems  are  not  robustly  stable  in  face  of  the  parametric  uncertainty  present 
in  the  model.  However,  for  the  LQR  design  the  closed-loop  system  is  stable  for  four  sets  of 
the  parameters  at  their  endpoints.  Though  this  is  not  undisputable  proof,  it  suggests  that  it 
is  possible  the  LQR  closed-loop  system  is  indeed  robustly  stable.  This  is  a  crude  method  for 
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Figure  7.5.  TVanslational  Response  of  PH,  and  Hga  Systems  to  Attitude  Thruster  Com¬ 


mands 
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Figure  7.6.  Rotational  Response  of  PH,  H2,  and  i/00  Systems  to  Attitude  Thruster  Conunands 
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Figure  7.7.  Robust  Stability  Analysis  of  the  System 

examining  robust  stability;  however  the  nature  of  our  system  leaves  few  options.  Though  the 
Hoo  system  fails  both  stability  robustness  tests,  let  us  examine  the  eigenvalues  of  the  closed-loop 
system  at  certain  endpoints  of  the  parameters  as  before. 

Figure  7.8  presents  the  near  pole  locations  for  the  nominal  closed-loop  Hao  system.  The 
following  four  Figures  show  the  closed-loop  poles  for  the  four  sets  of  perturbed  parameters. 
Figure  7.9  places  both  the  Km  a  and  the  K(}  at  their  minimum  values.  Figure  7.10  places  both 
sets  of  parameters  at  their  maximum  values.  Figure  7.11  places  the  Kma  at  their  minimum 
values  and  the  Ka  at  their  maximum  values.  Finally,  Figure  7.12  considers  the  last  permutation 
of  the  Kma  at  their  maximum  values  and  the  Ka  at  their  minimum  values.  Evaluating  these 
sets  of  parameters  for  the  closed-loop  Hoc  design,  as  performed  for  the  LQR  and  systems, 
shows  conclusively  that  the  system  is  indeed  not  robustly  stable. 

Aside  from  the  nominal  plant,  the  closed-loop  system  is  only  stable  for  system  2  shown  in 
Figure  7.10.  This  is  also  the  case  for  the  Hi  system  considered  in  Chapter  6.  Similar  to  the 
Hi  plant,  it  is  not  possible  to  draw  any  conclusions  for  the  Hoc  system  about  certain  regions 
of  operation  that  are  causing  instability  since  stability  is  a  problem  for  several  configurations. 

If  the  stability  problem  was  caused  by  neglected  dynamics,  a  solution  to  robust  stability 
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Figure  7.8.  Closed-Loop  Poles  of  the  Nominal  Hoc  System 


Figure  7.9.  Closed-Loop  Poles  of  the  Off-Nominal  Hao  System:  1 
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Figure  7.10.  Closed-Loop  Poles  of  the  Off-Nominal  Hoc  System:  2 


Figure  7.11.  Closed-Loop  Poles  of  the  Off-Nominal  Hoc  System:  3 
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Real 


Figure  7.12.  Closed-Loop  Poles  of  the  Off-Nominal  Hoo  System:  4 

would  be  to  reduce  the  bandwidth  of  the  system.  Unfortunately,  with  parametric  uncertainty 
the  system  poles  simply  aren’t  where  the  controller  is  expecting  them.  Reducing  the  bandwidth 
will  not  solve  the  problem.  Chapter  4  shows  that  the  open-loop  system  itself  is  not  robustly 
stable  in  face  of  the  parametric  uncertainty  present.  Therefore  even  a  zero  as  the  transfer 
function  for  the  controller  will  not  stabilize  the  plant.  However,  using  the  Hoc  design  process 
it  is  possible  to  design  for  robust  stability  directly  to  see  if  it  is  achievable  for  the  SSAD  plant. 

Recall  the  general  three  block  formulation  shown  in  Figure  7.13  and  its  closed-loop  repre¬ 
sentation  shown  in  Figure  7.14.  G{s)  is  the  closed-loop  transfer  function  which  combines  the 
dynamics  of  the  plant  and  the  compensator.  Recall  that  the  closed-loop  transfer  function,  G(s), 
can  be  divided  into  a  set  of  four  transfer  functions  as  follows, 

f  _  CiiC*)  Gui»)  »? 

c  Cj2i(<)  G22(«) 

Chapter  4  shows  that  if  ||Gii(s)||„o  <  1,  robust  stability  is  achieved  via  the  small  gain  test. 
Therefore,  the  Hex,  design  process  will  design  for  robust  stability  directly  with  no  consideration 
for  performance  if  the  input  and  error  vectors  are  chosen  such  that  the  transfer  function  from  d 


Figure  7.13.  Standard  Form  for  a  Compensated  Plant  with  Uncertainty 


Figure  7.14.  Closed-Loop  TVansfer  Function 
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to  e  is  Gii(s).  The  Hoo  design  methodology  will  try  to  minimize  the  //qq  norm  of  this  transfer 
function.  If  it  can  bring  the  maximum  singular  value  below  one,  by  the  small  gain  test  robust 
stability  is  guaranteed. 

Before  proceeding  on  this  robust  stability  design,  there  is  a  subtlety  to  consider.  In  order 
to  insure  that  the  rank  conditions  outlined  in  Section  7.1  are  not  violated,  the  exogenous  input 
vector,  d,  must  include  not  on/y  the  output  of  the  uncertainty  block  17,  which  replaces  the 
process  noise,  but  must  also  include  the  sensor  noise.  Similarly,  the  error  vector  e  must  include 
u  as  well  as  e.  These  are  conditions  (c)  and  (d)  from  Section  7.1.  Hence, 


■ 

* 

c 

d  = 

V 

e  = 

u 

The  subtlety  is  that  if  u  is  penalized  by  being  included  in  the  error  vector,  the  ability  to  achieve 
robust  stability  is  also  penalized  since  it  may  require  large  amounts  of  control.  Remember 
that  for  now,  performance  is  not  considered  at  all,  the  only  concern  is  to  see  if  any  controller 
can  robustly  stabilize  the  system’ .  Therefore,  the  design  process  to  achieve  a  robustly  stable 
compensator  is  to  decrease  the  penalty  on  u  until  the  maximum  singular  value  of  the  closed-loop 
system  falls  below  unity. 

After  decreasing  the  weight  on  the  control  channels  to  the  plant  starts  to  numerically 

fail  the  rank  tests.  Unfortunately,  at  this  point  the  plant  still  fails  to  achieve  robust  stability. 
Figure  7.15  presents  the  results  of  both  the  small  gain  test  and  p-analysis  on  this  *Gn’  Hoc 
design.  Though  robust  stability  is  not  achieved,  the  curves  are  at  their  lowest  levels  out  of 
every  design,  including  the  open-loop  system.  However,  do  not  think  of  this  system  as  less 
robustly  unstable  since  robust  stability  is  a  binary  test  that  this  design  fails.  Even  this  attempt 
at  robust  stability  has  come  at  enormous  cost. 

Recall  that  this  design  gives  basically  no  regard  for  performance  whatsoever.  Figure  7.16 
shows  the  scaled  singular  values  from  disturbance  to  control  effort.  The  joint  rate  limits  have 
been  exceeded  by  almost  five  orders  of  magnitude.  Figure  7.17  presents  the  singular  values 
from  disturbance  to  end  effector  position  and  attitude.  Clearly,  the  price  is  quite  high  in  the 
effort  to  achieve  robust  stability.  Unfortunately  the  goal  is  not  even  achieved. 
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Figure  7.16.  Control  Effort  Singular  Values  of  the  ‘Gn’  //<»  System 
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Singular  Values  from  Disturbance  to  End  Effector  Position 


Figure  7.17.  Performance  Singular  Values  of  the  ‘Gn’  Hex>  System 


7.3  Summary 

This  Chapter  presents  a  method  for  finding  Hoc  optimal  controllers  that  minimize,  to  a 
desired  tolerance,  the  Hoo  norm  of  the  transfer  function  from  disturbance  to  error  vector.  This 
problem  is  closely  related  to  the  design  process  which  is  minimizing  the  H2  norm  of  the 
same  transfer  function.  Both  techniques  involve  a  two  Riccati  equation  solution  process.  Both 
remove  the  requirement  for  full  state  feedback  uncorrupted  by  noise  called  for  in  the  LQR 
design. 

With  the  Hoc,  approach,  on  the  nominal  plant  we  are  able  to  achieve  performance  better 
than  the  H2  design.  Performance  is  better  in  the  sense  that  the  worst  case  direction  is  of 
less  magnitude  than  the  H2  design;  however,  this  came  at  the  expense  of  some  of  the  other 
directions.  Nevertheless,  both  performance  objectives  are  met  without  extc^-ding  the  limits  on 
the  control  effort. 

Unfortunately  the  closed-loop  //oo  system,  like  the  H2  system,  is  not  robustly  stable  in  face 
of  the  parametric  uncertainty  present  in  the  plant.  As  before  both  /i-analysis  and  the  small 
gain  test  as  well  as  actual  eigenvalue  analysis  confirm  the  results.  In  an  effort  to  achieve  a 
robustly  stable  design,  the  problem  is  recast  to  minimize  the  Hoc  norm  of  the  G^^  transfer 
function.  This  essentially  abandons  all  performance  desires  and  tries  only  to  achieve  robust 
stability.  This  process  fails  to  achieve  robust  stability  as  well. 
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Chapter  8 


Conclusions 


Three  regulator  designs  were  developed  to  improve  the  performance  of  the  Space  Shuttle 
Remote  Manipulator  System  (SRMS).  Designs  focused  on  the  Position  Hold  function  of  the 
SRMS.  Position  Hold  is  universal  to  normal  operations  of  the  SRMS.  In  Chapter  2,  the  SRMS 
system  was  examined  in  greater  detail.  A  linear  model  of  the  nonlineu  plant  was  developed 
in  Chapter  3.  Two  of  the  significant  non  linearities  were  modelled  as  parametric  uncertainties 
in  an  effort  to  capture  their  effect  on  the  system.  The  linear  model  was  compared  to  a  higher 
fidelity  nonlinear  model  in  the  time  domain. 

In  Chapter  4,  the  characteristics  of  the  open- loop  plant  were  explored.  Because  of  the 
numerical  problems  associated  with  six  integrator  poles,  subsequent  regulator  designs  used 
the  current  Position  Hold  system  as  the  open-loop  plant.  In  this  manner,  the  current  system 
moved  the  integrator  poles  off  the  ju;-axis.  In  effect,  the  new  designs  operate  as  stability  and 
performance  augmentation  around  the  current  system.  Model  reduction  was  applied  to  the  new 
open-loop  plant. 

The  performance  and  control  effort  of  the  open-loop  system  were  presented  in  Chapter  4  as 
a  baseline  with  which  to  compare  subsequent  regulator  designs.  The  performance  and  control 
effort  criteria  were  analyzed  in  the  frequency  domain  by  looking  at  three  sets  of  singular  values: 
disturbance  to  control  effort,  disturbance  to  end  effector  position,  and  disturbance  to  end  effec¬ 
tor  attitude.  A  typical  Shuttle  attitude  control  jet  firing  pattern,  which  acts  as  a  disturbance 
on  the  system,  was  simulated  in  the  time  domain. 

Also  in  Chapter  4,  a  sufficient  condition  for  robust  stability  was  developed  using  the  small 
gain  theorem  with  an  Hoa  norm.  The  structured  singular  value,  ft,  was  defined  and  shown  to 
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provide  a  necessary  and  sufficient  condition  for  robust  stability.  In  calculation,  /i-analysis  is 
reduced  to  a  sufficient  condition,  but  it  is  less  conservative  than  the  small  gain  test.  Unfortu¬ 
nately  these  tests  are  conservative  for  lightly  damped  systems  with  real  parametric  uncertainties 
-  the  exact  description  of  the  SRMS  model.  Application  of  these  tests  on  the  current  system 
determined  it  to  be  unstable.  As  confirmation,  the  eigenvalues  of  the  system  were  calculated 
for  certain  values  of  the  uncertain  parameters.  For  conditions  corresponding  to  low  amplitude, 
slow  motion,  the  current  system  was  found  to  indeed  be  unstable.  This  corresponds  to  the  slow, 
low  amplitude  limit  cycling  observed  in  the  actual  system. 

A  Linear  Quadratic  Regulator  (LQR)  was  designed  in  Chapter  5.  Its  performance  and 
control  effort  were  a  considerable  improvement  over  the  current  system.  Though  the  LQR 
failed  both  the  small  gain  test,  and  ^-analysis,  actual  eigenvalue  analysis  seemed  to  indicate 
that  the  system  may  be  robustly  stable.  Unfortunately  this  cannot  be  proved  unequivocally. 
Since  full-state  feedback  is  not  available  on  the  SRMS,  two  model  based  compensators  were 
designed. 

In  Chapter  6  the  H2  technique  and  in  Chapter  7  the  Hx  technique  were  presented.  There 
are  many  parallels  between  these  two  techniques.  In  both,  an  error  vector  is  constructed  from 
the  controls  and  some  linear  combination  of  the  states.  As  the  name  implies,  the  H2  technique 
designs  an  optimal  compensator  that  minimizes  the  H2  norm  of  a  transfer  matrix  using  only  the 
system  measurements.  Similarly,  the  technique  minimizes  the  norm.  Both  designs  were 
able  to  improve  the  performance  of  the  nominal  system  considerably.  However,  neither  design 
achieved  robust  stability.  Not  only  did  both  designs  fail  the  small  gain  test  and  p-analysis,  but 
actual  eigenvalue  analysis  showed  the  systems  were  indeed  not  robustly  stable. 

In  an  effort  to  achieve  a  robustly  stable  design.  In  Chapter  7  the  problem  was  recast  to 
minimize  the  Hoc  norm  of  the  G\\  transfer  function.  By  the  small  gain  test,  if  the  Hoc  norm 
of  this  transfer  function  falls  below  one,  robust  stability  is  assured.  This  essentially  abandons 
all  performance  desires  and  tries  only  to  achieve  robust  stability.  Unfortunately,  this  process 
fails  to  achieve  robust  stability  as  well.  This  modified  design  did  establish  the  trade-off  present 
in  the  pursuit  of  robust  stability  at  the  expense  of  performance  and  control  effort. 

If  the  stability  problem  was  caused  by  neglected  dynamics,  a  solution  to  robust  stability 
would  be  to  reduce  the  bandwidth  of  the  system.  This  could  be  accomplished  by  either  further 
penalizing  the  control  effort  or  reducing  the  weights  on  the  end  effector  states,  ie.,  demand 
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less  performance.  Unfortunately,  with  parametric  uncertainty  the  low  frequency  system  poles 
simply  aren’t  where  the  controller  is  expecting  them.  Reducing  the  bandwidth  will  not  solve 
the  problem.  Considering  the  open-loop  system  itself  is  not  robustly  stable,  even  a  zero  for  the 
compensator  will  not  make  the  system  robustly  stable. 

A  possible  extension  for  this  thesis  would  be  to  gain  schedule  the  controllers  over  certain 
regions  of  the  nonlinearities.  In  this  manner,  the  uncertainty  of  the  parameters  would  be  reduced 
allowing  a  family  of  controllers  to  be  robustly  stable  over  their  respective  regions.  A  ‘real  fi- 
analysis’  technique  would  aid  the  design  process  allowing  for  the  removal  of  conservatism  in  the 
gain  scheduled  controllers.  Additionally,  the  model  can  be  modified  to  incorporate  acceleration 
sensors  or  optical  measurements  to  provide  for  different  measurements  when  NASA  chooses  a 
new  sensor  suite. 

The  SRMS  model  suffers  from  lightly  damped  poles  that  have  large  real  parametric  un¬ 
certainty.  To  achieve  a  single  robustly  stable  design  a  ‘real  p-synthesis’  technique  is  needed. 
This  technique  must  not  only  use  the  information  of  the  structure  of  the  uncertainty,  but  also 
account  for  exclusively  real  perturbations.  The  SRMS  plant  is  an  ideal  candidate  for  a  real 
^-synthesis  technique. 
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